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Chapter 1 

Financial Introduction 


In this section we’ll discuss some of the basic ideas of option pricing. The main idea is 
replication, whereby a derivative security can be priced because it is shown to be equivalent 
in value to a portfolio of assets that already have a price-tag. Another word to describe such 
a notion of pricing is benchmarking, where to say that you’ve ‘benchmarked the security’ 
might mean that you’ve found a portfolio of other assets that do not replicate but will 
remain very close in value no matter what outcome occurs. 

The idea of finding a portfolio that is remains close in value to the derivative security 
is essentially the law of one price, which states that “In an efficient market, all identical 
goods must have only one price.” Indeed, we will assume that our markets are efficient, 
and in some cases we will assume that arbitrage has zero probability of occurring; these 
assumptions are routine and are generally not considered to be restrictive. 

The manner in which these notes can be considered ‘oversimplified’ is in the complete¬ 
ness of the resulting markets. In practice there are derivatives (e.g. insurance products) 
which cannot be hedged, and hence the market is incomplete. Both the discrete time- 
space market and the Black-Scholes market are simple enough for completeness to hold, 
but reverse-engineering these models from real-life market data will require interpretation. 

1.1 A Market in Discrete Time and Space 

Consider a very simple market where at time t = 0 it is known that there are only two 
possible states for the market at a later time t = T. In between times t = 0 and t = T there 
is no trading of any kind. This market is described by a probability space P = { 001 , 012 } 
with probability measure P given by 

Pl = P(W 1 ) = | , P2= P(W 2 ) = ^ • 

The elementary events oo\ and 002 are the two states of the market. The traded assets in 
this simple market are a bank account, a stock, and and a call option on the stock with 
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exercise at T and strike K = 2. The market outcomes are shown in Table o 


Table 1.1: Assets in the Discrete Time-Space Market 


time 

t=0 


t=T 

bank account 

Bq = 1 

Bt = 1 (interest rate r = 0) 

stock 

cn 

II 

S T = < 

"3 in oj\ 

1 in u >2 

call option, K = 2 

II 

•-o 

Ct = 

' 1 in 

. 0 in U 2 


The way to determine the price of the call option, Co, is to replicate it with a portfolio 
of the stock and the bank account. Let Vt denote the value of such a portfolio at time t, 
so that 


Vo = aSo + P 

Vt = aSx + P 

where a is ff of shares and P is $ in bank. The portfolio Vt will replicate the call option if 
we solve for a and P so that Vt = Ct for both uq and U 2 - 

3 a + P = aS , T(wi) + P = Ct(w i) = 1 
a T P = aSr(oh) + P = Ct{^2) = 0 . 

This system has solution a = P = — Hence, by the law of one price, it must be that 

Co = r 0 = i So -i = l. 

If Co 7 ^ Vo, then there would be an arbitrage opportunity in the market. Arbitrage in 
a financial models means it is possible to purchase a portfolio for which there is a risk-less 
profit to be earned with positive probability. If such an opportunity is spotted then an 
investor could borrow an infinite amount of money and buy the arbitrage portfolio, giving 
him/her an infinite amount of wealth. Arbitrage (in this sense) does not make for a sound 
financial model, and in real life it is known that true arbitrage opportunities disappear very 
quickly by the efficiency of the markets. Therefore, almost every financial model assumes 
absolute efficiency of the market and ‘no-arbitrage’. There is also the concept of ‘No Free 
Lunch’, but in discrete time-space we do not have to worry about theses differences. 

The notion of ‘no-arbitrage’ is defined as follows: 

Definition 1.1.1. Let wealtht denote the wealth of an investor at time t. We say that the 
market has arbitrage if 

P {wealthT > 0\wealtho = 0) > 0 
and P (wealthT < 0\wealtho = 0) = 0 
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i.e. there is positive probability of a gain and zero probability of a loss. 


In our discrete time-space market, if Co < Vo then the arbitrage portfolio is one that 
buys that option, shorts the portfolio, and invest the difference in the bank. The risk-less 


payoff of this portfolio is shown in Table 1.2 


Table 1.2: Arbitrage Opportunity if Option Mispriced 



t = 0 

t = T 

buy option 

-Co 

max{SV — 2,0} 

sell portfolio 

Vo 

— max {Ar — 2,0} 

net: 

Vo — Cq > 0 

0 


1.2 Equivalent Martingale Measure (EMM) 

A common method for pricing an asset is to use a risk-neutral or an equivalent martingale 
measure (EMM). The EMM is convenient because all asset prices are simply an expectation 
of the payoff. Bu the questions are: what is the EMM? Is there more one? 

Definition 1.2.1. The probability measure Q is an EMM of P if St is a Q-martingale, 
that is 

E q S t = S 0 

and Q is equivalent to P, 

P(w) > 0 Q(u;) > 0 

for all elementary events co € fi (it’s more complicated when Q is not made up of elementary 
events, but it's basically the same idea). 

For our discrete time-space example market, we have 

• Under the original measure 

ESt = PiSt(wi) +P2St(w2 ) = ^3 + -1 = - > 2 = 5° 

• Under an EMM Q(cji) = q\ and Q(co 2 ) = <72, 

E q St = qi3 + (1 - <7i)l = 2 = So . 

Solution is q\ = \ and c/2 = 1 — Qi = \ ■ 
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Given the EMM, a replicable option is easily priced: 

E q C t = E q V t = E Q [aS T + P] 

= aE Q S T + p = aS 0 + P = Vo = C 0 , 
and so Co = E^ max(5r — K, 2). 

1.3 Contingent Claims 

A corporation is interested in purchasing a derivative product to provide specific cash¬ 
flows for each of the elementary events that (they believe) the market can take. Let 
0 = , wat} be these elementary events that can occur at time t = T, and let the 

proposed derivative security be a function C't(w) such that 

C T (uJi) =Ci Vi < N , 

where each Cj is the corporations desired cash flow. The derivative product C is a contingent 
claim , because it pays a fixed amount for all events in the market. Below are some general 
examples of contingent claims: 

Example 1.3.1. European Call Option. At time T the holder of the option has the 
right to buy an asset at a pre-determined strike price K. This is a contingent claim that 
pays 

(St(w) — K) + = max(Sr(w) — K, 0) 

where St is some risk asset (e.g. a stock or bond). The payoff on this call option is seen 
in Figure 

Example 1.3.2. European Put Option. At time T the holder of the option has the 
right to sell an asset at a pre-determined strike price K. This is a contingent claim that 
pays 

(.K — St(oj)) + = max(lC — St{oj), 0) . 

Figure \1.S\ shows the payoff for the put. 

Example 1.3.3. American Call/Put Option. At any time t < T the holder of the 
option has the right to buy/sell an asset at a pre-determined strike price K. Exercise of this 
claim at timet < T is contingent on the event At = {wj| it is optimal to exercise at time t} C 

n. 

Example 1.3.4. Bermuda Call/Put Option. At either time T/2 or time T, the holder 
of the option has the right to buy/sell an asset at a pre-determined strike price K. Similar 
to an American option except only one early strike time. 
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Figure 1.1: The payoff on a European call option with strike K = 50. 





Figure 1.2: The payoff on a European put option with strike K = 50. 
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Example 1.3.5. Asian Call Option. At time T the holder of the option has the right 

to buy the averaged asset at a pre-determined strike price K, 

where tg = l for integer M > 0. 

Example 1.3.6. Exchange Option. At time T the holder of the option has the right to 

buy one asset at the price of another, 

(S'y(w) - 5|>(w)) + 

where Sj is the price of an asset and S) 2 is the price of another. 

1.4 Option Pricing Terminology 

The following is a list of terms commonly used in option pricing: 

• Long position, a portfolio is ‘long asset X’ if it has net positive holdings of contracts 
in asset X. 

• Short position, a portfolio is ‘short asset X’ if it has net negative holdings of 
contracts in asset X (i.e. has short sales of contracts). 

• Hedge, or ‘hedging portfolio’ is a portfolio that has minimal or possibly a floor on 
the losses it might obtain. 

• In-the-money (ITM), a derivative contract that would have positive payoff if set¬ 
tlement based on today’s market prices (e.g. a call option with very low strike). 

• Out-of-the-money (OTM), a derivative contract that would be worthless if set¬ 
tlement based on today’s market prices (e.g. a call option with very high strike). 

• At-the-money (ATM), a derivative contract exactly at it’s breaking point between 
ITM and OTM. 

• Far-from-the-money, a derivative contract with very little chance of finishing ITM. 

• Underlying, the stock, bond, ETF, exchange rate, etc. on which a derivative con¬ 
tract is written. 

• Strike, The price upon which a call or put option is settled. 

• Maturity, the latest time at which a derivative contract can be settled. 

• Exercise, the event that the long party decides to use a derivative’s embedded option 
(e.g. using a call option to buy a share of stock at lower than market value). 


10 



1.5 Completeness &; Fundamental Theorems 


The nice thing about the discrete time-space market is that any contingent claim can be 
replicated. In general, for 0 = {uj i, W 2 , •.., wjv}> and with N— 1 risky-assets (S 1 , S 2 ,..., S N ~ l ) 
and a risk-free bank account (with r = 0), replicating portfolio weights the contingent claim 
C can be found by solving 


(i 

i 40w 2 ) 

\1 4( w 4 


4'V i) 


4 1 (ujn)J 



f p \ 


/ Cl ' 


OL\ 

= 

C2 


VnJ 


which has a solution (/3,a i,..., ajy_i) provided that none of these assets are redundant 
(e.g. there does not exist a portfolio consisting of the first N — 2 assets and the banks 
account that replicated the S ^ _1 ). This TV-dimensional extension of the discrete time- 
space market serves to further exemplify the importance of replication in asset pricing, 
and should help to make clear the intentions of the following definition and theorems: 


Definition 1.5.1. A contingent claim is reachable if there is a hedging portfolio V such 
the Vt(uj) = C(u) for alluj, in which case we say that C can be replicated. If all contingent 
claims can be replicated, then we say the market is complete. 


Theorem 1.5.1. The 1st fundamental theorem of asset pricing states the market is 
arbitrage-free if and only if there exists an EMM. 


Theorem 1.5.2. The 2nd fundamental theorem of asset pricing states the market is 
arbitrage-free and complete if and only if there exists a unique EMM. 

One of the early works that proves these theorems is [Harrison and Pliska, 1 981] . An¬ 
other good article on the subject is |Schachermayer, 19921. In summary, these fundamental 
theorems mean that derivatives are just the expectation of some EMM. Therefore, if we 
have the EMM then we can say things about the risk of a derivative. In fact, even if our 
EMM is an oversimplification of real-life markets, reverse engineering the model to find out 
the EMM parameters can still tell us something. 
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Chapter 2 

Brownian Motion 8z Stochastic 
Calculus 


Continuous time financial models will often use Brownian motion to model the trajec¬ 
tory of asset prices. One can typically open the finance section of a newspaper and see a 
time series plot of an asset’s price history, and it might be possible that the daily move¬ 
ments of the asset resemble a random walk or a path taken by a Brownian motion. Such 
a connection between asset prices and Brownian motion was central to the formulas of 
Bla ck and Scholes, 1973| and |Merton, 1973| , and have since led to a variety of models 
for pricing and hedging. The study of Brownian motion can be intense, but the main 
ideas are a simple definition and the application of Ito’s lemma. For further reading, see 
|Bjork, 2000 , |0ksendal, 2003 . 


2.1 Definition and Properties of Brownian Motion 

On the time interval [0, T], Brownian motion is a continuous stochastic process ( Wt)t<T 
such that 

1 . Wo = 0 , 

2. Independent Increments: for 0 < s' < t! < s < t < T, Wt — W s is independent of 
Wv-Wsf, 

3. Conditionally Gaussian: Wt — W s ~ jV(0 ,t — s), i.e. is normal with mean zero and 
variance t — s. 

There is a vast study of Brownian motion. We will instead use Brownian motion rather 
simply; the only other fact that is somewhat important is that Brownian motion is nowhere 
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differentiable, that is 


P (Wt is undefined for almost-everywhere t £ [0, T]^ = 1 , 

although sometimes people write Wt to denote a white noise process. It should also be 
pointed out that Wt is a martingale, 

E{W t \(W T )r< s } = W s Vs<t. 
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Figure 2.1: A sample of 10 independent Brownian motions. 

Simulation. There are any number of ways to simulate Brownian motion, but to under¬ 
stand why Brownian motion is consider a ‘random walk’, consider the process 

w N = w N + 1 ^ T J N with P robabilit y V 2 

tn tn ~ 1 1 i \frjN with probability 1/2 

with Wt 0 = 0 and t n = rijj for n = 0,1, 2 ..,, N. Obviously II 7 // has independent incre¬ 
ments, and conditionally has the same mean and variance as Brownian motion. However 
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it’s not conditional Gaussian. However, as N —> oo the probability law of converges to 
the probability law of Brownian motion, so this simple random walk is actually a good way 
to simulate Brownian motion if you take N large. However, one usually has a random num¬ 
ber generator that can produce that also has conditionally Gaussian increments, and 
so it probably better to simulate Brownian motion this way. A sample of 10 independent 


Brownian Motions simulations is given in Figure 2.1 


2.2 The Ito Integral 

Introductory calculus courses teach differentiation first and integration second, which 
makes sense because differentiation is generally a methodical procedure (e.g. you use 
the chain rule) whereas finding an anti-derivative requires all kinds of change-of-variables, 
trig-substitutions, and guess-work. The Ito calculus is derived and taught in the reverse 
order: first we need to understand the definition of an Ito (stochastic) integral, and then 
we can understand what it means for a stochastic process to have a differential. 

The construction of the Ito integral begins with a backward Riemann sum. For some 
function / : [0, T] —> M (possibly random), non-anticipative of W, the Ito integral is defined 
as 

pT N-l 

i f(t)dW t =\im'£f(t n )(W tn+1 -W tn ) (2.1) 


n =0 


where t n = with the limit holding in the strong sense. Through an application of 


Fubini’s theorem, it can be shown that equation (2.1) is a martingale, 

(*T " rs 

f(t)dWt (W T ) T <s 


E 


Go 


= / f(t)dW t 
J o 


Vs < T . 


Another important property of the stochastic integral is the Ito Isometry, 

Proposition 2.2.1. (ltd Isometry). For any functions f,g (possibly random), non- 
anticipative ofW, with E f Q f 2 (t)dt < oo and E f Q g 2 (t)dt < oo, then 

E f(t)dW t '\ (^j\(t)dW^j =E £ f(t)g(t)dt . 

Some facts about the Ito integral: 


One can look at Equation (2.1) and think about the stochastic integral as a sum of 


independent normal random variables with mean zero and variance T/N , 
N—l , „ 

E /(*n)(W tB+ ! - W tn ) ~ M 0 , - E / 2 (^ 


71—0 


Therefore, one might suspect that f(t)dWt is normal distributed. 
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• In fact, the Ito integral is normally distributed when / is a non-stochastic function, 
and its variance is given by te Ito isometry: 



f(t)dW t 


r\j 


■Au o , 



• The Ito integral is also defined for functions of another random variable. For instance, 
/ : M —> K and another random variable Xt, the Ito integral is 



f{X t )dWt 


N -1 


lim 

TV—>-oo 


E f( X tn)(W tn+1 

n =0 


w t J . 


The Ito isometry for this integral is 

E (£ f(X t )dW t ) = j\f 2 {X t )dt , 
provided that X is non-anticipative of W. 


2.3 Stochastic Differential Equations &; Ito’s Lemma 

With the stochastic integral defined, we can now start talking about differentials. In 
applications the stochastic differential is how we think about random process that evolve 
over time. For instance, the return on a portfolio or the evolution of a bond yield. The 
idea is not that various physical phenomena are Brownian motion, but that they are driven 
by a Brownian motion. 

Instead of derivatives, the integrands in Ito integrals satisfy SDEs. For instance, 


dSt = fiStdt + aStdWt geometric Brownian motion, 

dY t = k(6 — Yt)dt + 'ydWt an Ornstein-Uhlenbeck process, 
dX t = a(X t )dt + b(X t )dWt a general SDE. 

Essentially, the formulation of an SDE tells us the Ito integral representation. For instance, 

a(X s )ds + [ b(X s )dW s . 

Jo 

Hence, any SDE that has no di-terms is a martingale. For instance, if dSt = rStdt+aStdWt , 
then Ft = e~ rt St satisfies the SDE dFt = aFtdWt and is therefore a martingale. 

On any given day, mathematicians, physicists, and practitioners may or may not recall 
the conditions on functions a and b that provide a sound mathematical framework, but 
the safest thing to do is to work with coefficients that are known to provide existence and 
uniqueness of solutions to the SDE (see page 68 of |0kscndal, 2003|). 


dX t = a(X t )dt + b(X t )dW t 


^ x t = x 0 + 
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Theorem 2.3.1. (Conditions for Existence and Uniqueness of Solutions to 
SDEs). For 0 < T < oo, let t E [0, T] and consider the SDE 

dX t = aft, X t )dt + b(t, X t )dW t , 

with initial condition Xo = x (x constant). Sufficient conditions for existence and unique¬ 
ness of square-integrable solutions to this SDE (i.e. solutions such that E J Q T \X t \ 2 dt < oo) 

are linear growth 

| aft, x)\ + | b(t, x)| < C(1 + |x|) Vx E M and Vf E [0, T] 
for some finite constant C > 0, and Lipschitz continuity 


|a(L x) - aft , y)\ + \b(t, x ) - b(t, y )| < D\x - y\ 
where 0 < D < oo is the Lipschitz constant. 


Vx, j/GM and Vi E [0, T] 


From the statement of Theorem 2.3.1 it should be clear that these are not necessary 
conditions. For instance, the widely used square-root process 

dX t = k{X - X t )dt + 7 s/x t dW t 


does not satisfy linear growth or Lipschitz continuity for x near zero, but there does exist 
a unique solution if 7 2 < 2Xn. In general, existence of solutions for SDEs not covered by 
Theorem 2.3.1 needs to be evaluated on a case-by-case basis. The rule of thumb is to stay 
within the bounds of the theorem, and only work with SDEs outside if you are certain that 
the solution exists (and is unique). 


Example 2.3.1 (Tanaka Equation). The canonical example to demonstrate non-uniqueness 
for non-Lipschitz coefficients is the Tanaka equation, 


dX t = sgn{X t )dW t , 

with Xo = 0, where sgn(x ) is the sign function; sgnfx ) = 1 if x > 0, sgn(x) = — 1 if x < 0 
and sgn(x) = 0 if x = 0. Consider another Brownian motion Wt and define 

W t = f sgn(W 8 )dW 3 , 

Jo 

where it can be checked that Wt is also a Brownian motion. We can also write 

dW t = sgn{W t )dWt , 

which shows that X t = Wt is a solution to the Tanaka equation. However, this is referred 
to as a weak solution, meaning that the driving Brownian motion was recovered after the 
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solution X was given; a strong solution is a solution obtained when first the Brownian 
motion is given. Notice this weak solution is non-unique: take Yt = —Xt and look at the 
differential, 

dY t = -dX t = -sgn(X t )dWt = sgn{Y t )dW t . 

Notice that Xt = 0 is also a solution. 


Given a stochastic differential equation, the Ito lemma tells us the differential of any 
function on that process. Ito’s lemma can be thought of the stochastic analogue to differ¬ 
entiation, and is a fundamental tool in stochastic differential equations: 


Lemma 2.3.1. (ltd Lemma). Consider the process Xt with SDE dXt = a{Xf)dt + 
b{X t )dWt. For a function f(t, x) with at least one derivative in t and at least two derivatives 
in x, we have 




fit, Xt)dt + b(X t )-^f(t, Xf)dWt 


( 2 . 2 ) 


Details on the Ito Lemma. The proof of (2.2) is fairly involved and has several details 
to check, but ultimately, Ito’s lemma is a Taylor expansion to the 2nd order term, e.g. 


fiWt) ~ f{W t0 ) + f'iW t0 )(Wt -W t0 ) + - 2 f(W t0 )(Wt - w t0 ) 2 

for 0 < t — to 1, (i.e. t just slightly greater than to). To get a sense of why higher 
order terms drop out, take to = 0 and t n = nt/N for some large N, and use the Taylor 
expansion: 

N—i N -1 

f(Wt) - /(Wo) = X] f'(Wt n )(W tn+1 - Wt J + - £ f"(Wt n )(W tn+1 - WtJ 2 

n=0 n =0 

N—l N -1 

+ 6 E WJ(W ^ +1 - W t J 3 + -£ f""iin){W tn+1 - WtJ 4 , 

n= 0 n=0 


where £ n is some (random) intermediate point to make the expansion exact. Now we use 
independent increments and the fact that 

E(W tn+1 - WtJ fc = | (x) k/2 (k ~ 1)!' ifk even 
from which is can be seem that the f and f” are are significant, 


(N -1 


N—l 


E Y,f'( W ^(. W tn + l- W tr, 


K 71 = 0 

N—l 


N 


^E|/(WtJ| 2 = 0(l) , 


71=0 

N -1 


E E f"iWt n )(W tn+1 - WtJ 2 = -E EAWO = 0 ( 1) , 


71=0 


71=0 
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and assuming there is some bound M < oo such that \f"'\ < M and \f""\ < M, we see 
that the higher order terms are arbitrarily small, 


E 


N -1 


E f"\W tn )(W tn+1 - WtJ 


n =0 


N—l 

<M^E\W tn+1 -W tn 

n =0 
N—l 


< m E\/ e \ w t n+1 ~ w tS = o(^ 


E 


N -1 


E f""(Hn)(W tn+1 - W t J 


n =0 


n=0 

1V-1 


N 1 / 2 


< 


M E E(Wi 


tn+1 


-W,) 4 = 0[ ^ 


n=0 


where we’ve used the Cauchy-Schwartz inequality, E|Z| 3 < Y^E|Zp for some random- 
variable Z. 


The following are examples that should help to familiarize with the Ito lemma and solutions 
to SDEs: 

Example 2.3.2. An Ornstein-Uhlenbeck process dXt = k(6 — Xf)dt + 'ydWt has solution 

X t = 6 + (X 0 - 9)e~ Kt + 7 f* e-^-^dWs . 

Jo 


This solution uses an integrating facto^\ of e Kt , 

dX t + kX fdt = nddt + 'ydWt ■ 

Example 2.3.3. Apply ltd’s lemma to Xt = W? to get the SDE 

dX t = dt + 2 W t dW t . 


Example 2.3.4. The canonical SDE in financial math, the geometric Brownian motion, 
JfT = pdt + adWt has solution 

S t = Soe^~^ 2 ) t+aWt 

which is always positive. Again, verify with ltd’s lemma. Also try Ito’s lemma on log(St). 

Example 2.3.5. Suppose dY t = (cj 2 !^ 3 — aY t )dt + aY^dWt- Apply Ito’s lemma to X t = 
— 1 /Y t to get a simpler SDE for X t , 


dXt = —aXtdt + adWt ■ 

Notice that Yt’s SDE doesn’t satisfy the criterion of Theorem 2.3. 1\ but though the change 
of variables we see that Yt is really a function of Xt that is covered by the theorem. 

x For an ordinary differential equation + nXt. = at, the integrating factor is e Ki and the solution is 
X t = Xoe~ Kt + f* a s e~ K, ^ t ~ s ^ds. An equivalent concept applies for stochastic differential equations. 
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2.4 Multivariate Ito Lemma 

Let Wt = ( W(. W 2 ,..., Wfi) be an n-dimensional Brownian motion such that |E WfWj = 
Ijj_j]. Now suppose that we also have a system of SDEs, Xt = (Xf,X 2 ,..., Xj n ) (with m 
possibly not equal to to) such that 

n 

dX\ = CLi{X t )dt + bij(X t )dWl for each i < m 
3 = 1 

where : M m —> M are the drift coefficients, and the diffusion coefficients bij : M m —> M 
are such that b % j = bji and 


^ bu{x)bj^{x)viVj >0 Mv G M m and Vx € M m , 
i,j =1 £=1 

i.e. for each x e M m the covariance matrix is positive definite. For some differentiable 
function, there is a multivariate version of the ltd lemma. 

Lemma 2.4.1. (The Multivariate ltd Lemma). Let f : M + x M m —> M with at 
least one derivative in the first argument and at least two derivatives in the remaining m 
arguments. The differential of f(t,X t ) is 


( £) m a 1 m 

i=i 1 i,j =l 


d 2 


dxidxj 


f(t, X t )dt 


d 


+ EE b ie (X t )—f(t,X t )dWi 


i=i e=i 


(2.3) 


where by(x) = J2?=i hl(x)bjt{x) 


Equation (2.3) is essentially a 2nd-order Taylor expansion like the univariate case of equa¬ 


tion (2.2). Of course, Theorem 2.3.1 still applies to to the system of SDEs (make sure 
a, and bij have linear growth and are Lipschitz continuous for each i and j). and in the 
multidimensional case it is also important whether or not n > m, and if so it is important 
that there is some constant c such that inf x b(x) > c > 0. If there is not such constant 
c > 0, then we are possibly dealing with a system that is degenerate and there could be 
(mathematically) technical problems. 


Correlated Brownian Motion. Sometimes the multivariate case is formulated with a 


correlation structure among the Wfis, in which the Ito lemma of equation (2.3) will have 
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extra correlation terms. Suppose there is correlation matrix, 



( 1 

Pl2 

Pl3 ■■ ■ 

Pin ^ 


P21 

1 

P23 

P2n 

p = 

P31 

P32 

1 ... 

P3n 


\Pnl 

Pn2 

Pn3 ■ ■ ■ 

1 / 

and such that J KW)W( 

= Pij 

for all i, j 

< n. 


d A d 1 ™ d 2 


df(t,X,) = [ ^ +Y,a,(X,)g^- + 5 E E 


2—1 


2j=l^,fc=l 

d 


dxidxj 


f(t,X t )dt 


+EE b ie (X t )—f(t,X t )dW t 


dxi' 

i= 1 £=1 * 

Example 2.4.1. (Bivariate Example) . Consider the case when n = m = 2, with 

dX t = a{X t )dx + b(X t )dW( 
dY t = a(Y t )dt + a(Y t )dW? 

and with ^E W(W( = p. Then, 
df(t,X t ,Y t ) = j t !(t,X,,Y t )dt 


+ +a(X,) i C ) f(t,X t ,Y,)dt+b(X t )Tf(t,X t ,Y t )dW! 

" -v-' 

Xt terms 

+ + «(^)| / ) f ^ Xt ’ yt)dt + a(Y t )^f(t,X t ,Y t )dW t 2 

" -v-' 

It terms 

+ pv(X t )b(X t )X-f(t,X t ,Y t )it 

' -^ 

cross-term. 

2.5 The Feynman-Kac Formula 

If you one could identify the fundamental link between asset pricing and stochastic differ¬ 
ential equations, it would be the Feynman-Kac formula. The Feynman-Kac formula says 
the following: 
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Proposition 2.5.1. (The Feynman-Kac Formula) . Let the function f(x) be bounded, 
let if(x) be twice differentiable with compact supporQ in K C M, and let the function q(x) 
is bounded below for all i£l, and let X t be given by the SDE 


dX t = a(X t )dt + b(X t )dW t 
For t E [0, T\, the Feynman-Kac formula is 


(2.4) 


v(t, x) = E 


f(X a )e~ i( x ^ du ds + e~ I? i( x ^ ds ^(X T ) 


X t =x 


and is a solution to the following partial differential equation (PDE): 


d 

dt 


v(t,x) + 


b 2 * (x) d 


d 


(2.5) 


dx2 + a {x)—j v(t,x) - q(x) v (t,x) + f(t,x) = 0 (2.6) 

v(T,x) = tp(x) ( 2.7) 


• Ifu>(t,x ) is a bounded solution to equations (2.6) and (2.7) for x E K, then u(t,x) = 
v(t, x). 

The Feynman-Kac formula will be instrumental in pricing European derivatives in the 
coming sections. For now it is important to take note of how the SDE in (2.4) relates to 
the formula (2.5) and to the PDE of (2.6) and (2.7). It is also important to conceptualize 
how the Feynman-Kac formula might be extended to the multivariate case. In particular for 
scalar solutions of (2.6) and (2.7) that are expectations of a multivariate process Xf E M m , 
the key thing to realize is that x-derivatives in (2.6) are the same is the df-terms in the 
Ito lemma. Hence, multivariate Feynman-kac can be deduced from the multivariate ltd 
lemma. 


2.6 Girsanov Theorem 

Another important link between asset pricing and stochastic differential equations is the 
Girsanov theorem, which provides a means for defining the equivalent martingale measure. 

For T < oo, consider a Brownian motion (Wt)t<T, and consider another process 8t that 
does not anticipate future outcomes of the W (i.e. given the filtration J 7 )' generated by 
the history of W up to time t, 0 t is adapted to F} v ). The first feature to the Girsanov 
theorem is the Dolean-Dade exponent: 

Z t = exp [~j 0 2 s ds + J 6 s dw)j (2.8) 

2 Compact support of a function means there is a compact subset K such that ip(x) = 0 if x ^ K. For a 

real function of a scaler variable, this means there is a bound M < oo such that ip(x) = 0 if |*| > M. 
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A sufficient condition for the application of Girsanov theorem is the Novikov condition, 


E exp 



< oo . 


Given the Novikov condition, the process Zt is a martingale on [0, T] and a new probability 
measure is defined using the density 


dQ 

dF t 


= Z t 


Vi < T ; 


(2.9) 


in general Z may not be a true martingale but only a local martingale (see Appendix |A| 
and |Harrison and Pliska, 1981]). The Girsanov Theorem is stated as follows: 


Theorem 2.6.1. (Girsanov Theorem). If Zt is a true martingale then the process 
W t = W t - f* 0 s ds is Brownian motion under the measure Q. 


Remark 1. The Novikov condition is only sufficient and not necessary. There are other 

^ 1 rT At n Itrr ' 

sufficient conditions such as the Kazamaki condition, sup rg 7 -Ee 2 Jo Vsaw s < qq w here T 
is the set of finite stopping times. 


Remark 2. It is important to have T < oo. In fact, the Theorem does not hold for infinite 
time. 


Example 2.6.1. (EMM for Asset Prices of Geometric Brownian Motion). The 

important example for finance the (unique) EMM for the geometric Brownian. Let St be 
the price of an asset, 

= hdt + adW t , 

St 

and let r > 0 be the risk-free rate of interest. For the exponential martingale 


Zt = exp I -- 


* l r ~^\ r -ZlXw t 


a 


a 


the process Wjf = ^ff-t + Wt is Q-Brownian motion, and the price process satisfies, 

d ^f = rdt + adW t Q . 

St 


Hence 


S t = S 0 exp ( ( r - -a ) t + aW t 


and St.e rt is a Q-martingale. 
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Chapter 3 

The Black-Scholes Theory 


This section builds a pricing theory around the assumptions of no-arbitrage with perfect 
liquidity and trades occurring in continuous time. The Black-Scholes model is a complete 
market, and there turns out to be a fairly general class of partial differential equations 
(PDEs) that can price many contingent claims. The focus of Black-Scholes theory is often 
on European call and put options, but exotics such as the Asian and the exchange option 
are simple extensions of the basic formulae. The issues with American options are covered 
later in Section lA3l 


3.1 The Black-Scholes Model 


The Black-Scholes model assumes a market consisting of a single risky asset and a risk-free 
bank account. This market is given by the equations 


= licit, + crdWt 
dBt = rBfdt 


geometric Brownian-Motion 
non-stochastic 


where Wt is Brownian motion as described in Chapter [2] and the interpretation of the 
parameters is as follows: 


!i is the expected rate of return in the risk asset, 
a > 0 is the volatility of the risky asset, 
r > 0 is the bank’s rate of interest. 


It turns out that this market is particularly well-suited for pricing options and other vari¬ 
ations, as well as analyzing basic risks associated with the writing of such contracts. Even 
though this market is an oversimplification of real life, it is still remarkable how a such a 
parsimonious model is able to capture so much of the very essence of the risky behavior in 
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the markets. In particular, the parameter a will turn out to be a hugely important factor 
in secondary markets for options, swaps, etc. 

The default focus in these notes will be the European call option with strike K and 
maturity T, that is, a security that pays 

(St — K) + = max (St — K , 0) , 

with strike and contract price agreed upon at some earlier time t < T. In general, the 
price of any European derivative security with payoff ip (St) (i.e. a derivative with payoff 
determined by the terminal value of risky asset) will be a function of the current time and 
the current asset price, 

C(t, St) = price of derivative security. 


The fact that the price can be written a function of t and St irrespective of S 's history is 
due to the fact the model is Markov. Through an arbitrage argument we will arrive at a 
PDE for the pricing function C. Pricing equations for general non-European derivatives 
(such as the Asian option discussed in Section 5.1) are determined on a case-by-case basis. 


3.2 Self-Financing Portfolio 

Let Vt denote the $-value of a portfolio with shares in the risk asset and the rest of it’s 
value in the risk-free bank account. At any time the portfolio can be written as 

Vt = a t S t + Pt 

where at is the number of shares in St (could be any real number) and fit is the $-amount 
in bank. The key characteristic that will be associated with V throughout these notes is 
the following condition: 

Definition 3.2.1. The portfolio Vt is self-financing if 

dV t = a t dS t + rfitdt 


with fit = Vt — at St- 


The self-financing condition is not entirely obvious at first, but it helps to think of one’s 
personal decision-making in a financial market. Usually, one chooses a portfolio allocation 
in stocks and bonds, and then allows a certain amount of change to occur in the market 
before adjusting their allocation. If you don’t remove any cash for consumption and you 
don’t inject any cash for added investment, then your portfolio is self-financing. Indeed, in 
discrete time the self-financing condition is 

VWi = + a t n (St n+ 1 - S tn ) + (e rAt - 1 )fit n 


where At 


t n +i — t n . This becomes the condition described in Definition 3.2.1 as At \ 0. 
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3.3 The Black-Scholes Equation 

For general functions f(t,s), the Ito lemma for the geometric Brownian motion process is 

m,St) = (| /(t, S,)dt + aS,^f(t, S,)dW, 

(recall equation \2.2\ from Chapter [ 2 ]). Hence, applying the Ito lemma to the price function 
C(t, St), the dynamics of the option and the self-financing portfolio are 

dC(t,S t ) = (J^+fiS t ^ + ^^C(t,S t )dt + aS t -^C(t,S t )dW t (3.1) 

dV t = a t dS t + r/3 t dt . (3.2) 

The idea is to find at that can be known to us at time t (given our observed history of 
prices) so that Vt replicates C(t,St) as closely as possible. Setting at = §~ s C(t, S t ) and 
f3t = Vt — at St, then buying the portfolio and shorting C gets a risk-less portfolio 

/ ft \ ( 8 cr 2 S 2 8 2 \ 

d(Vt - C(t, St)) =r(v t - St^C(t, St)) dt - ( - + ) C(t, S t )dt 

and by arbitrage arguments, this must be equal to the risk-free rate, 

= r(V t -C{t,St))dt . 

Hence, we arrive at the Black-Scholes PDE 

8 cr 2 s 2 8 2 8 \ 

m + ^^ + r ‘oi- ; ) c ^ = 0 ^ 

with C(T,s) = ip(s) (recall we denote payoff function for general European claim with 
function i)(s)). 

The power of the Black -Scholes PDE is that it replicates perfectly. Observe: if Vo = 
(7(0, S 0 ), then 

d(Vt-C(t,St)) = 0 Vt<T, 

and so Vt = C(T, St) = i)(St)- In fact, it can be shown that any contingent claim (not just 
Europeans) is replicable under the Black-Scholes model. Hence, the market is complete. 

3.4 Feynman-Kac, the EMM, &; Heat Equations 

Feynman-Kac is a probabilistic formula for solving PDEs like ( |3.3[ ). It also has financial 
meaning because it explicitly provides a unique equivalent martingale measure (EMM). 
Since we have assume no-arbitrage, the 1st Fundamental theorem of asset pricing (see 
Section [l]) necessarily asserts the existence of an EMM. The structure of this probability 
measure is given to us by the Feynman-Kac formula: 
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Proposition 3.4.1. (Feynman-Kac). The solution to the Black-Scholes PDE of (3.3) 
is the expectation 

C(t, s) = e -r(T-t) E Q[^( jS - r )| 5t = 
where E^ is an EMM under which e r ^ T ~ t ' ) St is a martingale, 


dSt = rStdt + crStdW r P , 


with W® = -t + Wt being Brownian motion under the EMM. 

Non-Smooth Payoffs. For call options, the function '0 is not twice differentiable nor does 
it have compact support, so Proposition 3.4. 1| is not a direct application of Feynman-Kac 
as stated in Proposition 2.5.1| of Chapter [2j There needs to be a further massaging of 
PDE to show that the formula holds for this special case. It is quite technical, but the end 
result is that Feynman-Kac applies to most payoffs of financial assets for log-normal models. 


Uniqueness. The uniqueness of the EMM in Proposition 3.4.1 can be argued by using 


the uniqueness of solutions to (3.3). The conclusion that the EMM is unique and that the 
market is complete. 


Relationship with Heat Equation. The Black-Scholes PDE (3.3) is a type of heat 
equation from physics. The basic heat equation is 

d d 2 

-u(t,x) = a 2 -^u(t,x) 

with some initial condition u\t=o = /. Solutions to the heat equation are interpreted as the 
evolution of Brownian motion’s probability distribution. In the same manner that passage 
of time will coincide with the diffusion of heat from a source, the heat equation can describe 
the diffusion of possible trajectories of Brownian motion away from their common starting 
point of Wq = 0. 


Equation (3.3) obviously has some extra term and an ‘s’ in front of the 2nd derivative, 


but a change of variables of t = T — t and x = log(s) leads to a representation of the 
solution as 

C(t, x) = C(T — r, e x ) 

where C solves the Black-Scholes PDE. Doing the calculus we arrive at a more basic PDE 
for C, 


i diT - x) = ip 


2 d 2 ~ f d 

C(t, x) + ( b— -r ) C(t,x ) 


dx 


with initial condition C{ 0,x) = ip(e x ), and with b = 2r 9 cr2 . 
equation with drift bJ^C(r,x), and decay rC{r,x). 


Hence Black-Scholes is a heat 
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3.5 The Black-Scholes Call Option Formula 

Let ip(s) = (s — K) + . From Feynman-Kac we have 

C(t, s ) = e-^-^E Q [(S T - A') + |St = s] 

with dSt = rStdt + aSt.dW®. Through a verification with Ito’s Lemma we can see that 
underlying’s value at time of maturity can be written as a log-normal random variable, 

S T = S t exp - ^<r 2 ^ (T — t) + a(W T - W t )\ . 

And so log(5 r /5t) is in fact normally distributed under the risk-neutral measure, 

log(S r /S t )~Af ((V-V) (T-t),a\T-t)^j . 

Hence, we compute the expectation for ip(s) = (s — K ) + , 

E«{(S T -iF)+|St = s} 


S t 


\j2lXG 2 (T - t) J\og(K / St) 


„x e -\(x-(r-\a 2 )(T-t)f /(<7 2 { T ~t)) fa 


= (t) 


-KQ(\og(S T /S t )>log(K/S t )) 

" -V-' 

=(*) 


where Q is the risk-neutral probability measure, 
(f). First compute (f) (W.L.O.G. for t = 0): 


(t) = 

So r 


So 


x — (r— ^ <t 2 )T 'A ^ 


e^e 2 V ^ 


dx 


a/ 27rcr^T Jiog(K/s 0 ) 

So r 


\j2l\(J 2 T J\og(K/So) 

-2^T (-^ 2 ^ 2 T+x 2 -2x(r-.5a 2 )T+((r-.5a 2 )T) 2 ) d2 


e 2 (t^t 


V2tt(j 2 T J\og(K/s 0 ) 

_ S 0 e rT f°° 
V2na 2 T Ji O g{K/s 0 ) 


(x 2 -2x{r+.5cr 2 )T+((r-.5a 2 )T) 2 ) ^ 


e 2cr z T 


^—(x 2 -(r+.h<j 2 )T)‘ 


dx 
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change of variables v = - ^ , dv = dx/(aVT), so that 

S 0 e rT f°° 


\/27T J(log(K/S 0 )-(r+.5a 2 )T)/(aVT) 


l v 2 , 
e 2 dv 


= Soe rT [ 1 - 


1 


r(log(K/S 0 )-(r+.5a 2 )T)/(aVT) 


a/27T J-c 


e^dv 


= S 0 e rT (l-N(-d 1 )) 

where d\ = lo s( s o/ K )+(r+- 5c7 ) T anc [ _/y(-) is the standard normal CDF. But the normal CDF 
has the property that N(—x) = 1 — N(x), so 

(f) = Soe rT N(d\) . 

(*). Then, computing (*) is much simpler, 

(*) = KQ (log(5 T /5 0 ) > log(K/S 0 )) 

( log(ffr/£o) - (r - .5a 2 )T > \og(K/So) - (r - .5cr 2 )r \ 

V cjVT ~ aVT ) 

^ flog(S T /S 0 ) - (r - . 5cr 2 )T < log(K/S 0 ) - (r - .5 a 2 )T 


= K ti¬ 




er 


Vt 


a 


Vt 


= K (1 - N{-d 2 )) = KN(d 2 ) 


where do = lo &( 5 o/ A )+h — ,,OT ) T = d\ — oVT. Hence, we have the Black-Scholes formula for 
(jy/T 

a European Call Option, 

Proposition 3.5.1. (Black-Scholes Call Option Formula) . The call option on St 
with strike K at time t with price St is given by 

C(t , S t ) = StN(di) - Ke~ r ^N(d 2 ) 
where JV(-) is the standard normal CDF and 

, \og(S t /K) + (r+.ha 2 )(T-t) 

«1 = - - 

aVT^t 

d-2 = d\ — oVT — t . 

A plot of the Black-Scholes call option price with K = 50, r = .02, T = 3/12, and 


a = .2 with varying Sq is shown in Figure 3.1 
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Black Scholes Call 



Figure 3.1: The Black-Scholes call price with K = 50, r = .02, T = 3/12, and a = .2. 
Intrinsic value refers to the payoff if exercised now, (So — K) + . 
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3.6 Put-Call Parity and the Put Option Formula 

It is straight forward to verify the relationship 


(S T - I<) + - (K - S T ) + = St- K . 


(3.4) 


Then applying the risk-neutral expectionan e r ( T to both sides of (3.4) to get the 

put-call parity, 

C(t, St) - P(t, St) = St - Ke- r{ ~ T ~V (3.5) 


where C(t, St) is the price of a European call option and P(t, St) the price of a European 
put option with the same strike. From put-call parity we have 


P(t, s) = C(t , s) + Ke~ r( - T -V - s 


= ~Ke- r ^ T - t \N(d 2 ) - 1) + s(N(d \) - 1) 

= Ke- r{ ~ T - t) N(-d 2 ) - sN(-di) , 
because 1 — N(x) = N(—x) for any iGl. 

Proposition 3.6.1. ( Black-Scholes Put Option Formula) . The put option on St 
with strike K at time t with price St. is given by 


P(t,S t ) = Ke~ r ^N(-d 2 ) - S t N(—di) 


where 


di = 


log(S t /K) + (r + .5o 2 )(T-t) 


a\jT — t 
d- 2 = d\ — a\/T — t . 

The Black-Scholes put option price for K = 50, r = .02, T = 3/12, and o = .2 and 


varying Sq is shown in Figure 3.2 


3.7 Options on Futures and Stocks with Dividends 

This section will explain how to compute European derivative prices on stocks with a con¬ 
tinuously paying dividend rate, and on future prices. There is some technical issues with 
stocks paying dividends at a discrete times, but in the case of European options it is merely 
of matter of considering the future price. Dividends can be an issue in the Black-Scholes 
theory, particularly because they will determine whether or not Black-Scholes applies for 
American options (see Section 5.3). 
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Black Scholes Put 



Figure 3.2: The Black-Scholes put option price for K = 50, r = .02, T = 3/12, and a = .2. 
Intrinsic value refers to the payoff if exercised now, (K — <Sb) + . 
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Continuous Dividends. Suppose that a European option with payoff iP(St) is being 
priced in a market with 

"TT^ = (M - q)dt + adW t 

Jt 

dB t = rB t dt 

where q > 0 is the dividend rate. Self-financing in this case has dynamics 


dVt = a t .dSt + r(Vt - a t S t )dt + qa t S t dt . 

The replicating strategy from the non-dividend case applies to obtain at = £ C{t,S t ), but 
the arbitrage argument leads to a different equation, 


2 - 2 d 2 d 


d cr“s 

dt + ~2~ds 2 


+ (r - q)s— - (r - q) ) C(t, s ) = qC(t, s ) 


(3.6) 


with C(T,s ) = ip(s). Equation (3.6) is the Black-Scholes PDE with dividends, and is 
solved by 

C(t,S t ) = e-^ T -^C\t,S t ) 

where C' 1 (t,S'i) solves the Black-Scholes equation of (|3.3|) with r replaced by r — q. For 


example, a European call option on a stock with dividends: 
Proposition 3.7.1. (Call Option on Stock with Dividends). 

C(t,S t ) = S t e~ q ^ T ~ t ' ) N(di) - Ke~ r( - T - t) N(d 2 ) 

where 

log(S)/ K) + (r — q + .5er 2 )(T — t) 
dl = -- 

d 2 = d\ — oVT — t . 


Futures. Now we turn out attention to futures. Suppose that the spot price on a com¬ 
modity is given by 

7 Q 

= pdt + odW t . 

ht 

From arbitrage arguments we know that the future price on St is 

F t , T = Ste^-V 


with Ftt = St- However, Ste r ^ is a martingale under the EMM, and so Fi t is also a 
martingale with 


dF t) T 

Ft,T 


odW t Q , 
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and the price of a derivative in terms of F t) x is C(t,F tj T) = e r ^ T ^C(t, F^t) where C 
satisfies 

d cr 2 x 2 d 2 \ ~ 

* + = ° < 3 ' 7 > 
with C(T,x ) = For example, the call option on the future: 

Proposition 3.7.2. (Call Option on Future). 

C(t, F tjT ) = e~ r ^ ( F t , T N(<h ) - KN(d 2 )) 

where 

, log(F t! T/K) + .5a 2 (T-t) 
dl - -- 

d.2 = d\ — g\JT — t . 

The Similarities. Futures contracts certainly have fundamental differences from stocks 
paying dividends, and vice versa. But the Black-Scholes PDE for pricing options on futures 
is like that for a stock with dividend rate r. Alternatively, the future on a dividend paying 
stock is 

F t T = 


which is a non-dividend paying asset and can hence be priced using equation (3.7). 

Discrete-Time Dividends. This interpretation of dividend-paying stocks as futures is 
useful when dividends are paid at discrete times. Discrete time dividends are described as 

n(t) 


log(S t /S 0 ) = (r - l(i 2 j t + crW t Q - ^2 Si 


i =o 


where 5i is a proportional dividend rate, and n(t) is the number of dividends paid up to 
time t. The future price of dividend paying stock St is 


F t r = S t exp 


(H>-o- E *) ■ 

V ^ ' i=n(t )+1 J 


Call option on Ftt can be priced with equation (3.7). 
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Chapter 4 

The Black-Scholes Greeks 


The Greeks a set of letters labeling the quantities of risk associate with small changes in 
various inputs and model parameters. The Greeks have importance in risk management 
where hedging and risks are determined by how much/little of the Greeks are exposed on 
a portfolio. There are 5 main Greeks associated with the Black-Scholes, one of which has 
already been instrumental in setting up the hedging portfolio, namely the A. The other 
Greeks can be equally as important in determining the quality of a hedge. 


4.1 The Delta Hedge 


Recall the derivation of the Black-Scholes PDE of equation (3.3) on page 25 In particular, 


recall the hedging allocation in the underlying was the first derivative. This is the Black- 
Scholes A, 


A(t,s) = ^- s C{t,s ) . 

In general, the hedge in the underlying what’s referred to when someone talks about being 
long, short or neutral in A. With continuos trading the hedging portfolio is a perfect 
replication because it continuously rebalances to remain A-neutral. A long position in A 
would be when the hedging portfolio holds the underlying in excess of the A (e.g. a covered 
call), and a short position in A would be a hedge with less (e.g. a naked call). 

For the European call and put options, the Black-Scholes A is 


A caii{t,s) = N(di) 

A P ut(t,s) = -N(-di) = A ca u(t,s) - 1 


which are obtained by differentiating the formulas in Propositions 3.5.1 and 3.6.1, respec¬ 
tively. Plots of the A’s for varying s are shown in Figure [4~Tj Notice that all the action in 
the A-hedge occurs when the underlying price is near the strike price. Intuitively, a call 
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or put that is far money is efficiently hedged by either holding a share of the underling 
or simply holding the cash, and the risk is low because there very little probability of the 
underlying changing dramatically enough to effect your position. Hence, the A-hedge is 
like a covered call/put when the derivative is far from the money. 


Black Scholes Call A 



Black Scholes Put A 



Figure 4.1: Top: The Black-Scholes A for a European option with strike K = 50, time to 
maturity T = 3/12, interest rate r = .02, and volatility a = .2. Bottom: The put option 
A. 

4.2 The Theta 

The next Greek to discuss is the 0, which measures the sensitivity to time. In particular, 

The financial interpretation is that 0 measures the decay of the time value of the security. 
Intrinsic value of a security is fairly clear to understand: it is the value of the derivative at 
today’s price of the underlying. However, derivatives have an element of time value on top 
of the intrinsic value, and in cases where the intrinsic value is zero (e.g. an out-of-the-money 
option) the derivative’s entire cost is its time value. 
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For option contracts, time value quantifies the cost to be paid for having the option to 
buy/sell, as opposed to taking a position that is long/short the underlying with no option 
involved. In most cases the time value is positive and decreases with time, hence 0 < 0, 
but there are contracts (such as far in-the-money put options) that have positive 0. The 
0’s for European calls and puts are 


Q C aii(t,s) = - 2 ^LN'(d 1 ) - Ke~ r ^N(d 2 ) 

(4.1) 

@ P ut(t,s)- n (J !fLiV'(di) + A-e- r ( T - t )iV( d 2 ) 

1 — t 

(4.2) 

and are shown in Figure |~i~2| To understand why a far in-the-money put option has positive 
0, look at Figure |4.3| and realize that the option price must with rise with time if it is 


below the intrinsic value. 


Black Scholes Call 0 



Black Scholes Put 0 



Figure 4.2: Top: The Black-Scholes 0 for a European option with strike K = 50, time to 
maturity T = 3/12, interest rate r = .02, and volatility a = .2. Bottom: The put option 


0 . 


In terms of hedging, a short position in a derivative is said to be short in 0, and a long 
position in the derivative is also long the 0. Hence, a A-neutral hedge will exposed to 0, 
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Black Scholes Put 



Figure 4.3: The Black-Scholes price of a European put option with strike K = 50, time to 
maturity T = 5, interest rate r = .02, and volatility a = .2. Notice if the price is low then 
the time value of exercise today is higher than the value of the option. This means that 0 
of far in-the-money put options is positive. 
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and a decrease in 0 will benefit the short position because it means that the time value of 
the derivative decreases at a faster rate. 

4.3 The Gamma 

The T of a security is the sensitivity of its A-hedge to changes in the underlying, and there 
a handful of ways in which the Y explains hedging and risk management. In a nutshell, 
the T is the higher order sensitivity of the hedging portfolio, and is essentially a convexity 
term for higher order hedging. 

Let At be the amount of time that goes by in between hedge adjustments. From Ito’s 
lemma we have the following approximations of the derivative dynamics and a self-financing 
portfolio: 


/ 2 o2 \ 

ACt ~ f 0* + —2^-r*) At + A f AS 

AV t ~ A t • AS t + r (V t - A t • S t ) At 


where ACt = C*t+At — Ct and A St = St+At. ~ St, and where 0*, A f and F/ ; are the 
derivative’s Greeks. Subtracting one from the other gets the (approximate) dynamics of a 
long position in the portfolio and short the derivative, 


AVt - AC t — r (V t - A t • S t ) At + 

" -V-' 

earned in bank 

premium over risk-free 

and so while the A-hedge replicates perfectly in continuous time, there will be error if 

a 2 S 2 

this hedge is readjusted in discrete time. However, —Q t -> 0 corresponds to the 

premium over the risk-free rate earned by the hedging portfolio. Often times T > 0 and 
0 < 0, and so the risk taken by rebalancing a A-hedge in discrete time is compensated 
with lower (more negative) 0 and lower T. 

For European call and put options, the Y is 


-0t- 


a 


2 S? 


t Yt ) At 


(4.3) 


Y(t,s) 


N'(ch) 

StaVT^t 


(4.4) 


which proportional to the probability density function of log (St/S t) and is always positive 
for long positions. Furthermore, combining the F in (|4.4|) with equation (4.1) we find the 


premium of equation (4.3) is 


- @caii(t,s ) - ^P-Y{t,s) = Ke-< T -^N(d 2 ) > 0 , 
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and so the A-hedge for a European call option earns a premium over the risk-free rate. 
However, the A-hedge of the put option has a negative premium 

-& P ut(t,s) - = -Ke~ r ^N{-d 2 ) < 0 , 

which means the portfolio that longs the put option and shorts the A-hedge will earn a 
return over the risk-free rate. 

If A-hedging is too risky then the T is used in higher order hedging, namely the A-r 
hedge. The idea is to consider a second derivative security C\t, s ) with which to hedge. A 
hedge that is both A-neutral and T-neutral is found by solving the equations 


C{t,S t ) = atSt + PtC'(t,St) + vt 
A(t,St) = a t + PtA'(t,S t ) 


where at is the number of contracts in the underlying, (3t is number of contracts in C' , and 
r]t is the $-amount held in the risk-free bank account. This hedge is more expensive and 
will earn less premium over the risk-free rate, 


AV t 





A t -{- 




At , 


with — 0j being the premium over the risk-free rate. A plot of the Black-Scholes T is 
shown in Figure [4~4| 


4.4 The Vega 


The Vega is the Greek to describe sensitivity to a. Of the 5 main Greeks it is the only 
one that corresponds to a latent parameter (although some could argue that the risk-free 
rate is also somewhat latent or abstract). Hence, not only is it important to have a good 
estimate of cr, it is also important to know if your hedging position is sensitive to estimation 
error. Indeed, volatility is a major question inn derivative pricing, and will be addressed 
in complete detail in Section [6j 

For the Black-Scholes call and puts, the Vega is 

^-C = StN\(h)VT^t 


which is proportional to the T and to the probability density function of log (St/S t) (see 
Figure 4.4). There are some portfolios that are particularly sensitive Vega, for instance a 
straddle consisting of a call and put with the same strike K if K is near the money. 
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Black Scholes r 



Black Scholes Vega 



Figure 4.4: Top: The Black-Scholes T = proportional to the a Gaussian proba¬ 

bility density function of log (St/S t). 
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4.5 The Rho 


The p is the sensitivity of to changes in the risk-free rate 




Of the 5 main Greeks it widely considered to be the least sensitive. For the Black-Scholes 
call and put options, the p is 


p call = K(T-t)e~ r ^N(d 2 ) 

Pput = —K(T - t)e~ r ^N(-d 2 ) . 

Notice that p ca u > 0 because higher interest rates means risky assets should rise in price 
and hence it will be more likely that you will exercise. Similarly, p pu t < 0 because higher 
interest rates means it is less likely that you will exercise. 


4.6 Other Greeks 

There are also other Greeks of interest in derivatives. Some of the are 

• A, t|Cx^, elasticity, or measure of leverage. 

• Vanna, The sensitivity of the A to changes in volatility. 

• Volga, ^jC 1 , used for more elaborate volatility hedging. 
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Chapter 5 

Exotic Options 


This section describes some of the better-known facts regarding a few exotic derivatives. 
The American and the Asian options are considered ‘path-dependent’ because the payoff 
of the option will vary depending on the history of the price process, whereas exchange 
options and binary options are not path-dependent because the payoff only depends on the 
asset price(s) are the terminal time. 


5.1 Asian Options 

Asian option consider the average price of the underlying in the payoff, 

These options are path dependent because the path taken by the underlying is considered. 
The averaging in the Asian option makes it harder to manipulate the payoff by driving the 
underlying’s price up or down when the option is close to maturity. 


Delta Hedge. In the Black-Scholes framework, the Asian option is a relatively simple 
extension of the hedging argument used to derive the Black-Scholes PDE. The trick is to 
consider an use an integrated field, 



and consider the price 


C(t, s, z) = price of Asian option given St = s and Z t = z. 


Let the underlying’s dynamics be 


dS t 

St 


pdt + adWt 
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where Wt is a Brownian motion. From Ito’s lemma we have 


dC(t,S„Z t ) = (| + C(t,S t ,Z t )dt + vS t £-C(t,S l ,Z,)<Wt ■ 

On the other hand, we have a self-financing portfolio, 

dV t = a t dS t + r(V t - a t S t )dt 

where at is the number of contracts in St and (V) — atSt) is the $-amount in the risk-free 
asset. Comparing the Ito lemma of the option to the self-financing portfolio, we see that a 
portfolio that is long Vt and short the Asian option will be risk-less if we have the A-hedge 

at = ^C(t,S t ,Z t ) • 

Furthermore, since the portfolio long Vt and short the option is risk-less, it must be that 
it earns at the risk-free rate. Hence, 

( r\ \ / r\ 2 C*2 02 Cj O \ 

^ - S t -C(t, S t , Z t )j + + C{t ’ Su Zt)dt 


= r(V t — C(t, S t , Z t ))dt . 

Hence, the price of the Asian option satisfies the following PDE, 


d a 2 s 2 d 2 d s d . . . 

Wt + ^2Td? + S ds + Td~z~ r ^ C{t ' S ' z) ~ 0 


C(t,s,z) 


t=T 


= (z-K)- 


Equations (5.1) and (5.2) can be solved with a Feynman-Kac formula, 

C(t, s, z ) = e _r ^E° \{I T - K) + S t = s,Z t = z 


(5.1) 

(5.2) 


where is the expected value under the unique EMM Q, and the asset price is ^ = 
rdt + adW® under Q. We knew from Chapter [H] that this market was complete, so it 
shouldn’t come as a surprise that we were able to hedge perfectly the Asian option. 


PDE of Reduced Dimension. A useful formula is the dimension reduction of the Asian 
option price to a function of just time and one stochastic held. Define a new function 


x ) = E^ 



S t = 1 
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Then let J~t denote the history of prices up time time t, and define the martingale 


M t = E Q 



S-u d/a — K 


Tt 


= E Q 




Tt 


= S t E q 



K ~ T fo S u du 


S t 


Tt 


= S t <f>{t,X t ) 

K-— f* S du 

where Xt = -2^—-—. Applying the Ito lemma to we have 

dX t = —^dt + X t (^—adW® — rdt + a 2 dt^j , 
and assuming that 4>(t,x) is twice differentiable we apply the Ito lemma to (j>{t,Xt): 

W,X<) =(! + (-? + X ‘ X-r)) §~ x + X X^) •Kt,X,)it-X t ^(t,X t )dW? . 

Then applying Ito’s lemma to Mt, we have 

dM t = (Pit, X t )dS t + S t d(p(t, X t ) + dS t • #(t, X t ) 

= (j>(t,X t ) (rS t dt + aStdW t Q )+S t (j^ + (~^+X t (a 2 - r)) ^ +X 2 t a 2 ^j </>(t,X t )dt 


r\ r\ 

-S t X t a—(j>(t,Xt)dW t Q - X t S t a 2 —<p(t, X t )dt , 

and in order for Mt to be a martingale the dt terms must all cancel out. Therefore, we 
must have 


d 


1 


d 


d 2 


rW,X t ) + ( Wt - ( - +rX t ) — ) 4>(t, Xt) = 0 , 


dx 


dx 2 


which gives us the PDE of reduced dimension for a new function <j{t,x) = e rlyT f){t,x ), 


d 


dt 


d 


XI- X + rx + U(M) = 0 


d 2 


dx 


dx 2 
4>(t,x) 


t=T 


= max(— x,0) . 


(5.3) 

(5.4) 
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The PDE of equations (5.3) and (5.4) is of reduced dimension and is solvable with a 
Feynman-Kac formula, 

4>(t,x) = E[max(— Xt, 0)|Xj = x\ 

with dX t = — ( i + rX t ) dt + aX t dWt . Finally, the option price is 


C(t,S t ,Z t ) = Sf<l>[t 


K - Z t 

S t 


Vt < T . 


(5.5) 


Equation (5.5) is the pricing formula of reduced dimension. 


5.2 Exchange Options (Margrabe Formula) 

Consider two tradable assets, Sj and Sf. An exchange option with strike K has payoff 

V’OSr.SfO = (S^-S^-K) + 


which we price price under an EMM. In particular, for K = 0 we have 
C{t, Sl ,s 2 ) = e- r( ~ T -^E Q [(S^-SfO+ISt 1 = si,S? = s 2 

'S' 


= e _r(T_t) E Q 


S T ( ^ ~ 1 

kJrjn 


Sl = 8 1 ,$ = S2 


(5.6) 


Options like this are found in forex markets and in commodities, the latter of which uses 
them to hedge risk in the risk between the retail price of a finished good verses the cost 
of manufacture. In electricity markets, the price of a megawatt hour of electricity needs 
to be hedged against the price of natural gas in what’s called the spark spread; in the soy 
market the cost of soy meal verses the cost of soy beans is called the crush spread; in the 
oil market the cost of refined oil products verses the cost of crude oil is called the crack 
spread. In commodities markets these options are called sometimes called spread options. 
The Margrabe formula prices spread/exchange options with K = 0. 

Suppose we have a double Black-Scholes model for the two assets, 


dS[ 

si 

dS! 


rdt T ci ^ l — p 2 dWl + pdWf 


- = rdt + a 2 dW t 
St 

where W t 1 and Wj 2 are independent risk-neutral Brownian motions and p G (—1,1). Now 


we define ratio in (5.6) as 


Y -Sl 

‘ Sf 
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define an exponential martingale 


and notice that 


Z t = exp ( --erf t + a 2 W 2 


C(t, ai, s 2 ) = s 2 eQ[Z t (Y t - 1 )+\Sl = Si, S 2 = s 2 \ . 

Recall the Girsanov Theorem from Chapter [2] and recognize that Zt defines a new measure 
P'5 under which Wt = W 2 — a 2 t is Brownian motion, and under which remains a 
Brownian motion independent of Wt■ Applying the Ito lemma to Yt we get 

dY t = Y t (a 2 2 - a 2 (Jip)dt + Y t ai (y/l - p 2 dW t l + pdW ?) - Y t a 2 dW 2 


= Y t ai (yi -p 2 dW} + pdW t ) - Y t a 2 dW t , 


and we can now write the option price under the new measure to get the Margrabe 
Formula: 


C(t, Si, s 2 ) = s 2 E q 


(Yt - 1 )- 


Y 


si/s 2 


= s 2 


^Si e -^(af+al- 2 a 2 aip)(T-t)+cT 1 ^l-p 2 (Wj.-W t 2 )+p(WT-Wt)')-a2(WT-Wt) _ ^ 


Y t = si/s 2 


= SiN(di) - s 2 N(d 2 ) 

where N(-) is the standard normal CDF function and 


di 


d 2 


\og(si/s 2 ) + \o 2 (T - t ) 
cry/T — t 
di — a\jT - t , 


where a 2 = af + a\ — c la 2 aip. Essentially, the Margrabe formula is the Black-Scholes call 
option price with r = 0, Sq = si, K = s 2 , and a 2 = a 2 + a 2 — 2a 2 aip. 


5.3 American Options and Optimality of Early Exercise 

Broadly speaking, options on index funds (such as S&P 500) are European, whereas options 
on individual stocks (such as Apple) are American. American options have the same 
features as their European counterparts, but with the additional option of early exercise. 
The option of early exercise can be equated with the option to wait and see, as the prices 
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of these options is usually thought of the present value of the payoff when exercised at an 
optimal time. For this reason the American options are consider path-dependent. Methods 
for pricing American options are quite involved, as there are no explicit solutions. Instead, 
pricing is done by working backward from the terminal condition on a binomial tree, or by 
solving a PDE with a free boundary. Pricing methods are not covered here; these notes 
focus on some fundamental facts about American options. 

The first thing to notice is that an American option is worth at least as much as a 
European. Hence, the prices are such that 

American Option = European Option + value of right to early exercise 


> European Option. 

In particular, we will that an American put has positive value for the right to early exercise, 
and so does the American call when the underlying asset pays dividend. 

American Call Option. Let 6'/' denote the price of an American call option with strike 
K and maturity T. At any time t < T we obviously must have C.f > St — K, otherwise 
there is an arbitrage opportunity by buying the option and exercising immediately for a 
risk-less profit. Therefore, we can write 

Cf = max (S t - K, Cf ont ) 

where Cf° nt is the continuation value of the option at time t\ at time t < T the price is 
determined by whether or not it is optimal to exercise. Letting C'f denote the price of the 
European call, it is clear that the long party cannot lose anything by having the option to 
exercise early (provided they do so at a good time) and so 

Cf = max (S t - K, Cf ont ) > Cf . 

Proposition 5.3.1. For an American call option on a non-dividend, paying asset, it is 
never optimal to exercise early. Hence, American and European calls have the same price 
and 

(jA (jcont Q E 

Proof. The proof follows from the European Put-Call parity and is simple. Early exercise 
is clearly not optimal if St < K. Now suppose that St > K. Letting P E denote the price 
of a European put option, we have 

Cf = max (St — K, Cf ont ) > Cf = Pf -\-St~Ke~ r ^ T ~ t ' > > St—K = value of early exercise, 

which proves the continuation value is worth more than early exercise, and the result 
follows. □ 


47 



To illustrate this proposition consider the following scenario: Suppose you’re long an 
ITM American call, and you have a strong feeling that the asset price will go down and 
leave the option OTM. In this case you should not exercise but should short the asset and 
use the call to cover your short position. For example, suppose the asset is trading at $105, 
you’re long a 3 month American call with K = $100, and the risk-free rate is 5%. Now 
compare the following two strategies: 

1. Exercise the option and invest the proceeds: 

• Pay K = $100; sell the asset at $105. 

• Invest proceeds of $5 in bond. 

• In three months have $5.06. 

2. Short the asset, hold the option, and invest the proceeds: 

• Short the asset at $105. 

• Invest the proceeds of $105 in bond. 

• In three months have $106.32 in the bank; owe no more than $100 to cover your 
short position. 

From these two strategies it is clear that early exercise is sub-optimal, as the latter strategy 
has at least an edge of $1.32. Alternatively, one could simply sell the call, which will also 
be better than exercising early. 

The introduction of dividends changes things. In particular, early exercise can be 
optimal if the present value of the dividends is worth more than the interest earned on the 
strike and the time value of the option. We see this from the put-call parity 

Cf > Cf = Pf + S t - Ke-^-^ - PVt(D) 

where PVt(D) is the present value of dividends received during the life of the option. 
For European options, dividends can be expressed in terms of a yield, 5 > 0, so that 
PV t (D) = S t ( 1 - e-^-V) and 

Cf > Cf = Pf + 5 t e" <5(T - t) - A"e" r(T - t) . 

It may be optimal to exercise early if Cf < St — K, which is possible in the presence of 
dividends as we can see in Figure |5.l[ The intuition for this figure is that Pf -> 0 as 
St —> oo, and that Cf ~ Ste~ S( ' T ~ ) — Ke~ r ^ T ~ t ' ) < St — K for St big enough. This only 
proves that the option for early exercise has positive value, therefore implying that early 
exercise may be optimal at some point; it does not mean that one should exercise their 
American option right now. 
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Figure 5.1: The Black-Scholes price of a European call option on a dividend paying asset, 
with K = 50, T — t = 3/12, r = .02, a = .2, and 5 = .05. Notice that early exercise exceeds 
the European price for high-enough price in the underlying. 
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Proposition 5.3.2. For an American call option on a dividend-paying asset, it may be 
optimal to exercise early. 

Proof. Let 6 > 0 be the dividend rate and assume r > 0. From the put-call parity, 


Cf > Cf = Pf + Ste-^-V - Ke-^-V = Pf + ( e -^- r ^ T - t) S t - A") . 

Now suppose that 5, r , K and S are such that 


Cf < S t -K . 


Clearly, Cf > St — K, and so in this case we have strict inequality Cf > Cf, which 
indicates the early exercise of the American option adds value, and hence early exercise 
can be optimal if done at the correct time. □ 


In practice, dividends are paid at discrete times, and early exercise of an American op¬ 
tions should only occur immediately prior to the time of dividend payment. The intuition 
is similar to the non-dividend case: at non-dividend times it is optimal to wait; time value 
of strike amount in bank and the right to exercise are more valuable. 


American Put Option. Regardless of dividends, early exercise of an American put 
option may be optimal. To understand why, simply consider a portfolio consisting of an 
American put with strike K > 0 and a single contract in the underlying. If the underlying 
hits zero, then it is certainly optimal to exercise. In general, early exercise of a put option 
will be optimal if the price drops low enough to where the time value of the strike amount 
in the bank account is worth more than the option to wait on selling the asset plus the 
present value of any dividends that might be recieved. 


Proposition 5.3.3. Early exercise of an American put option may be optimal if r > 0. 
Proof. By contradiction. Suppose it is never optimal to exercise an American put. Then 

P t A = max (K - S t , Pf ont ) = Pf° nt = Pf 

where Pf ont is the continuation value of the option and Pf is the price of a European put. 
Recall Figure 43 on page 37 of Section [4j where we see that 


Pf 


e -r(T-t) K as St ^ Q 


If r > 0 there is a price S ° > 0 such that Pf < K — St for all St < Sf which is a 
contradiction. Hence, early exercise can be optimal. □ 


Proposition 5.3.3 is valid regardless of whether or not there are dividends, but the time 
value of dividends will counterbalance the time value of the cash received from exercise. If 
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r = 0, then the European put option is always greater or equal to its intrinsic value, which 
means 

K - S t < P t E < P t A = max (K - S t , P t cont ) , 

with strict inequality between K — St and P E if St > 0. Hence, if P (St > 0) = 1, then 
P (P A = P£ ont ^) = 1 and early exercise is never optimal. Joke: Therefore, the Fed should 
drop interest rates if they are worried about rampant sell-offs by option holders. 

To summarize, American options have three components beyond their intrinsic value, 

• value of option to early exercise (always positive impact on price), 

• time value of money (positive for calls, negative for puts), 

• dividends (negative for calls, positive for puts). 

The value of an American option over the price of a European option is essentially the 
value of early exercise. While the above results are model-free, the price (and the optimal 
exercise boundary) will be model-dependent. 

5.4 Bermuda Options 

A Bermuda call option has two exercise times T\ and T 2 with 

0 < Ti < T 2 < 00 . 

At time T\ the holder of the option has the right to exercise a call with strike K\ , or to 
continue with the possibility to exercise at time T 2 with exercise /i 2 . Hence, the risk-neutral 
price is 


C B (t ) = 


e -r(Ti-i)]gQ 

C E (t ) 


max 


(S Tl -K u C E (TO) 


Tt 


for t < T\ . 
for T\ < t < T 2 


where C E (t) is price a European call option with strike A 2 and maturity T 2 . 
Consider the Black-Scholes model, 


dSt 

St 


= (r — q)dt + crdW® , 


where q is the dividend rate. Recalling the Black-Scholes price for a call on a dividend¬ 
paying asset (see Section 3.7 of Chapter [ 3 ]), the time-Ti payoff for the Bermuda call option 
is 

V’(s) = max — Ki, se~ q ^ T2 ~ Tl ' 1 N(d\) — A'e _r ^ T_< ' ) A'((i 2 )^ , 
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where 


, _ log (s/K 2 ) + (r-g+ .5 a 2 )(T 2 - T x ) 
Cl ay/W^Ti 

d>2 = d\ — a\JT 2 T\ . 


It follows that for t <T\ that the Bermuda option price C B (t, s ) is given by the PDE 


( d a 2 s 2 d 2 

\m + ~Td^ + {r 



rj C B (t, s) = 0 
C b (T 1 ,s) = ^(s) . 


When pricing the Bermuda option is intuitive to think about the appropriate early-exercise 
rule. We learned from Section 5.3 that it is never optimal to exercise at time T\ if q = 0. 
However if q > 0 then we need to think of a rule for when to exercise. The rule turns out 
to be simple: find the optimal value s e such that at time t = T\, 


exercise if SV, > s e , 
do not if St x < s e . 

In the next subsection we consider the American put, where there will be a time-dependent 
exercise rule. 


5.5 Free-Boundary Problem for the American Put 

Consider the Black-Scholes model, 


■ = rdt + adW® , 

with r > 0. In this case it may be optimal exercise early an American put. The price P A 
of the put is the solution to an optimization problem, 


P' 4 (f, s) = supE^ 

T>t 


e _ r( T A r-t)( K _ SrAT y 


St = s 


where the r is the optional exercise time chosen by the holder, and T A r = min(T, r). 
Define £ to be the PDE operator, 


d a 2 s 2 d 2 
dt 2 ds 2 


d_ 

ds 
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If at time t it is optimal to continue then CP A (t, St) = 0; if it is optimal to exercise then 
£P A (t,St) = C(K — St) < 0 and ^P A (t, St) = — 1. This is equivalent to finding the pair 
(. P A , s e ) such that 

CP A (t , s) = 0 for t < T and s > s e (t) , 

P a (T,s) = (I< -s)+ 

P A (t, s e (t)) = —1 for t < T . 
os 

This is the so-called free boundary problem , which refers to the fact that we need to solve 
for the boundary s e (t) and the PDE’s solution P A (t,s). A straight-forward way to solve 
this PDE and to see the boundary is with a backward recursion on a binomial tree. 
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Chapter 6 

Implied Volatility &; 
Local-Volatility Fits 


A major point of discuss with Black-Scholes pricing is the role of volatility. The questions 
comes about when the Black-Scholes formula is inverted on the market’s option prices, 
which produces an interesting phenomenon known as the implied volatility smile, smirk, or 
skew. The implied volatility suggests that asset prices are more complex than geometric 
Brownian motion, and the Black-Scholes’ parameter a needs to be dynamic. Local volatility 
models and stochastic volatility models are two well-known ways to address this issue; 
jump-diffusion models and time-change jump models are also a popular choice of model. 

6.1 Implied Volatility 

Implied volatility is the parameter estimate obtained by inverting the Black-Scholes model 
on market data. The interesting part is that European call and put options will return a 
different parameter for different strikes and maturity. Let C BS (t,s,k,T,a ) be the Black- 
Scholes price of a call option with strike K and maturity T, and let Cf ata (K,T) be the 
price in the market. 

Definition 6.1.1. Implied Volatility is the quantity a(K,T) > 0 such that 

C? ata {K , T) = C BS (t, S t , K, T , a(K, T)) . 

Traders often quote option prices in implied volatility, probably because it is a ‘unit-less’ 
measure of risk. Typically, the implied smile/skew is plotted as a function of the option’s 
strike as shown in Figure [6bfl When plotting implied volatility, the term ‘Moneyness’ refers 
to how far the option is in-the-money (ITM) or out-of-the-money (OTM). Moneyness can 
also be interpreted as a unit-less measure, for instance the log-money ness, 

log-moneyess = log [Ke~ r{ ' T ~ B> /St) ■ 
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Implied volatility is skewed to the left for many assets, but some assets (such as forex 
contracts) have a right skew (see Figure [ih2) ). The interpretation is the followoing: implied 
volatility smile/skew suggests that Black-Scholes is an oversimplification, and the left skew 
of equity options (such as the SPX ETF options) suggests there is ‘crash-o-phobia’ for long 
positions in the underlying, and long put options are an insurance item for which there is 
a premium in the market. 


Implied Volatility Skew 
1 year Vs 3 Month 



- SPX 1Y Skew (5/20/2010 Close) - SPX 1Y Skew (9/8/2010 Close) 


Figure 6.1: Implied Volatility Skew of SPX Index Options. On May 20th, 2010, the S&P 
500 closed at 1072, so moneyness refers to something close to the intrinsic value. 
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The volatility smile of the EURJPY 



10 D 15 D 20 D 25 D 35 D atm 35 D 25 D 20 D 15 D 10 D 
Call Call Call Call Call Put Put Put Put Put 

Delta of the Option 


Figure 6.2: The implied volatility smile for options on the Euro/Yen currency exchange. 
Notice the skew is to the right. 
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Bounds on Implied Volatility. There are some bounds on implied volatility that can 
obtained from the Black-Scholes formula. 


Proposition 6.1.1. Changes in implied volatility have bounding inequalities 

e ~r(T-t) N{ _ d2) ^ ^ < e~r^N{d 2 ) 

StVT^tN'{di) ~ dK a{ ’ “ StVT^tN'(di) 

where d\ and d 2 are given by the Black-Scholes formula and are functions of a. 


The derivation of Proposition 6.1.1 is as follows: 
For the call option, 

d 


_ C data {K ) = c B Sl (t> 5t> R ^ T)) 


d 


hence 


+C? b (t, S t , K, T, a(K, T))—a(K, T) < 0 . 

d_. (K T \ K Cf? Sl (t, St, K, T, d(K, T)) 
dK a[ ’ Cff s {t, S t , K, T, d(K, T)) 

e - r (T-i)N(d 2 ) 

~ StVT^tN’fo) ’ 

where it is straight forward to verify the A'-derivative to be 

d 


c ° Sl = ( S t N[d x ) - Ke- r{ ?-^N(d 2 j) = -N(d 2 ) 


( 6 . 1 ) 


The ratio in (6.1) has the Greek letter vega in the denominator, which causes it to blow-up 


when K is far-from-the-money, hence the estimate will be more useful for neat-the-money 


options (see Figure 6.3). A similar inequality is derived from the put option, 


Sv/T^tN'id,) ~ OK ’ 


This growth bound is also shown in Figure [673 


Another important bound on implied volatility is the moment formula for options with 
extreme strike. Let x(K ) = log {Ke~ r ^ T n / Sf)- There exists x* > 0 such that 


d(K, T) < 


T-t 


\x(K) | 


( 6 . 2 ) 
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Bound on Implied Volatility Growth 



Figure 6.3: The growth bound on an option with underlying price St = 50, time to maturity 
T — t = 3/13, interest rate r = 0, and implied volatility a = .2. For an ATM option, the 
bounds of proposition 6.1.1 show that derivative of implied volatility with respect to strike 
to be bounded by about 5%. 
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for all K s.t. \x(K )| > x*. The power of this estimate is that it provides a model-free bound 
on implied volatility. In other words, it extrapolates the implied volatility smile without 
assuming any particular structure of the probability distribution. The upper bound in (6.2) 
can be sharpened to the point where there is uncovered a relationship between implied 
volatility and the number of finite moments of the underlying St- 

Proposition 6.1.2. (The Moment Formula for Large Strike). Let 

p = sup{p : < 00 } , 


and let 


Then /3 r G [ 0 , 2 ] and 


where 1/0 = 00 . Equivalently, 


a ,. a 2 (K,T) 

PR = llm sup , • 

K^oo \x(K)\/T 


- _ 1 Pr 1 
P ~2p R + 8 2 


Pr = 2 - 4 (vV +P - P ) 


where the right-hand side is taken to be zero when p = 00 . 

There is also a moment formula for extremely small strikes and q = sup{(/ : E ^Sf q < 
00 }. Both moment formulas are powerful because they are model-free, and because they 
say that tail behavior of the underlying can be characterized by implied volatility’s growth 
rate against log-moneyness. Indeed, an asset with heavy tails will have higher implied 
volatility, and so going long or short an option will have more risk, and hence these options 
will cost more. The important details of the moment formula(s) are in the original paper 
of Roger Lee |Lee, 2004] . 


6.2 Local Volatility Function 


The idea the volatility is non-constant was first implemented in local volatility models. A 
local volatility model is a simple extension of Black-Scholes 

7 O 

= pdt + a(t, S t )dWt 
Pt 

where the function a(t, s ) is chosen to fit the implied volatility smile. Dependence of a on 
St does not effect completeness of market, so there is a pricing PDE that is try similar to 
the Black-Scholes and is derived in much the same way: 


d s?o 2 (t,s t ) d 2 

dt 2 ds 2 



C(t, St) = 0 


(6.3) 
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with terminal condition C \ t _ T = (s — /i) + . Hence, the Feynmann-Kac formula applies and 
we have 


C(t, s ) = e- r(T - t) E Q [(5 T - K) + \S t = s} 


where E® is a risk-neutral measure under which ^ = rdt + a(t. St)dWf. Now, observe 
the following facts: 


1. The first derivative with respect to K is written with a CDF, 




= e -r(T-t)_d_ 


OK 


K^[(S T -K) + \S t = s] 


= e _r(T_t) E Q 
= -e~ r( - T ~ t] E Q 


E(St-A T 


St = 8 


[S t >K} 


S t = s 


= -e~ r ^Q(S T > K\S t = s ) 
= n - 


(l-Q(S T <K\S t = s)) 


2. The second derivative with respect to K is a PDF, 

02 02 

9 K 2 C &*) = e ' r(T_t) ^ 2 eQ [(^ - I<) + \S t = s] 

= - e ' r(T_t) ^ (! " ^ = s)) 

= e~ r{T - t ^M s T< K \ s t = s) , 

which is commonly referred to as the Breeden and Litzenberger formula. 

3. Finally, denote by C(t, s, k ) all the list call options with strike T and strikes k ranging 
from zero to infinity. For a fixed K > 0, differentiating with respect to T and repeated 
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application of integration-by-parts yields the following 


§fC(t,s,K) 

_d_ 

~dT 


(e- r{ ?-W[{S T - K) + \S t = a]) 


= - re-^-^KSr - K)+\S t = s] + Q [(S T - K)+\S t = s} 

= - re-^ T -^E Q {(S T - K) + \S t = s} 


+ e-'< T -‘)|,E« 


(• - K)+ + l (rS.lp.jK, + a2( "' 2 S ") S " fe(S„ ) ) in 


S, = s 


a(u, S u )S u l [Su >K]dW.$ 


St = 8 


=0 


= - re-^-^UST - K)+\St = s ] 

+ e _r(T_t) IE Q 


, ° 2 (T,St)S 2 T ; fcl , 
rS T l[s T >K} H-~- Ok {St) 


St = 8 


=rKe~ r{ - T ~ t) E q 


■ * 2 (t,s t )s 2 

1 [St>^] ^-o- 0 K (b T ) 


St = 8 


<9 2 a 2 (T,k)k 2 , . 

=rK j l [k > K] -^C(t,s,k)dk + J --- 6 K (k)-^C(t,s,k)dk 


d _ a 2 (T,K)K 2 <9 2 


= ~rK—C(t,s,K) + 


dK 2 


C(t, s, IC) . 


The above steps 1 through 3 have derived what is known in financial literature as Dupire’s 
Equation: 



K 2 a 2 (T, K) d 2 
2 ~dK' 2 


+ rK 



C{t, s, K) =0 


(6.4) 


with initial condition C(t, s, K)\x=t = 

implied local volatility: 


(s — K) + . From (6.4) we get the Dupire scheme for 


I 2 (T,K) = 2 


( 8T + r -k 9k) C 


K 2 


JP_r 

dK 2 L 


(6.5) 


which can be fit to the market’s options data for various T’s and A"’s (see Dupire, 1994J). 

This is called ‘calibration of the implied volatility surface’, although methods for calibrating 
local volatility can be quite complicated with numerical differentiation schemes in T and K, 
regularization constraints, etc (for more on fitting techniques see [Achdou and Pironneau, 2005|). 


61 




























Figure 6.4 shows a calibration of the local volatility surface. 


Implied Volatility Surface 



Time to Matutity T 


Moneyness M = ^ 


Figure 6.4: Implied Volatility Surface 

Shortcomings of Local Volatility. There are some shortcomings of local volatility, one 
of which is the time dependence or ‘little t‘ effects, which refers to changes in the calibrated 
surface with time. In other words, the volatility surface will change from day to day, a 
phenomenon that is inconsistent with A-hedging arguments for local volatility. 


6.3 Fitting the Local Volatility Function 

Define the log moneyness k = log (K/Ste r ^ T ~ t 'i), and the time-weighted implied volatility 

u(T, k ) = d 2 (T, S t e r{ ~ T -^e k ^ (T - t) . 

It is straight forward to verify that the call-option price can be written with a dimensional 
version of Black-Scholes formula, 

C(t,s,K) = e-^-^E q [{S t - K)+\S t = s} 

= sC BSi (lo(T, k),k) 

= s(N(ch) - e k N(d 2 )) , 

where C BSl (cu, k) is the Black-Scholes call-option price St = 1, time to maturity T — t = 1, 
strike e k , interest rate r = 0, initial asset price = 1, and volatility y/u; the formula 
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parameters are d\ = —k/y/u + y/u/2 and ^2 = d\ — y/u. The dimensionless option price 
C BS i (oj, k) takes only log moneyness and time-weight volatility, and then the market’s 
option price is proportional to the dimensionless price. 

Now, there are the following derivatives, 

s, I<) = C BSl (w(T, k),k)^ + C BSl (u)(T, k ), k) 9UA ^ 

= —rC? Sl {u(T, k),k) + C BS ^co(T, k), k) (u t (T, k ) - nj k (T, k)) ( 6 . 6 ) 


lM c{t ' s ’ K) = C * Sl MT ’ k) ’ k) M + C ° Sl (w(T ’ fc) ’ k) d ^~ M 

= ^C BS ' (u;(T, fc), A;) + ■ s ( W (r, fc), k)uj k (T, k) 


(6.7) 


1 d 2 

s <9A' 2 


o2 / f)b \ 2 f) 

C(t,s,K) = w C BS '{u(T,k),k) ( —) + -C BS '{u(T,k),k) 

= ^5 U^C BS (u,(T,k),k)-C BS '(u:(T,k),k) 


d 2 k 

dK 2 


( 6 . 8 ) 


Plugging ( 6 . 6 ), (6.7), and ( 6 . 8 ) into the implied local volatility function of (6.5) yields 

C BS ^u;(T,k),k)u; T (T,k) 


a 2 (t,S t e r ( T = 2-^- 


^C BS i(u(T, k),k) - ^j:C BSl ( uj(T, k), k) 
The denominator in (6.9) involves the following three derivatives: 

C%?'(u,k) = Q - C BS ‘(u,k) 

C* k Sl (cu, k ) - C B ‘ Sl ( W , fc) = (w, A;) . 

Using (6.10), (6.11) and (| 6 . 12), the denominator in (|6.9|) becomes 


(6.9) 


( 6 . 10 ) 

( 6 . 11 ) 

( 6 . 12 ) 


^C BS ^u(T,k),k) - ^C BSi (uj(T, k), k) 

= C BB ' (u>(T, k ), fc)K(T, A;)) 2 + Cf 51 (w(T, A:), A> fcfc (T, k ) + 2C* Sl ( W (T, A:), A:)w fc (T, A:) 
+ q?/ 1 (w(T, A:), A:) - C* 5l (w(T, fc), fc) Wfe (T, A;) - C BS 1 (u;(T, A:), fc) 

= < ( (^(T, A:)) 2 + u; fcfc (T, A;) - ^ Wfe (T, k) + 2 j (cu(T, k), k) , 
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which we then plug into (6.9) to get the following implied local volatility formula in terms 
of the implied volatility surface: 


u 2 ( T,K) 


ut(T, k ) 


u(T,k)^ k ^i k) + 4 


(T,k) u 2 (T,k ) J + 2 U} kk{T, k) 


(6.13) 


6.4 Surface Parameterizations 


Given the formula in (6.13), it remains to decide on a differentiable model for u(T, k). One 


approach is to find a parametric function f e (T,k ) to fit the implied volatility surface. If 
the function is once differentiable in T and twice in k, then the chain rule can be applied 


to obtain the derivatives in the (6.13). 


Let 6 be a vector of parameters fit to the weighted surface, 


min Y I k "> ~ w ( r ’ 


T,k 

where f e is the parametric function. Given a fit 9, we need to check that 

• the fitted surface is a free from calendar-spread arbitrage, 

• and each time slice is free of butterfly arbitrage. 

Proposition 6.4.1. The fitted surface is free from calendar-spread arbitrage if 

$(T, k)>0', 

in other words the slices of the weighted surface cannot intersect. 

To prevent butterfly arbitrage, effectively we need to ensure that the option price yields a 
probability density function, 

Q (S t < K\F t ) = er{T_t) ^2 C(t,T,K) . 

Using the parameterized surface, this density is 

e ' [T ~ t) W c(t - T - K) = * e ’' |7 ' _ ‘ ) ^ cBSl (/'’U,t),t) 


g d (k) 


f 9 (T,k) 


e ~\dl(k;0) 
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where 9) = — kf \Jf e (T, k ) — y 7 f e {T , /c)/2, and 


<A*0 = 1 - 


kf d k (T,k) Y fk(T, k) c “ 

2f(T,k) 


1 i A | y&cr,fc) 
/ 0 (r,fe) + 4y 2 


(6.14) 


Proposition 6.4.2. The fitted surface is free from butterfly arbitrage if and only ifg e (k) > 
0 and lim/ c _ 5 ._|_ 00 d\{k] 9) = — oo, where d\{k\ 9) = —k/\Jf e (T, k) + \Jf e (T, k)/ 2. 

Example 6.4.1 (The SVI Parameterization). One such model is the stochastic volatility 
inspired (SVI) parameterization of the implied-volatility surface. The SVI uses the para¬ 
metric function 

(6.15) 


f e {k) = a + b ^ p(k — m) + y 7 (k — m) 2 + 


with parameters 9 = (a,b, p,m,f) fitted across k and T. Clearly, the SVI has &>f 9 (k) = 0 
to avoid calendar-spread arbitrage (per Proposition 6.4-1) and parameterizations can be 
found to avoid butterfly arbitrage per Proposition 6 . 4 . 
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Chapter 7 

Stochastic Volatility 


Stochastic volatility is another popular way to fit the implied volatility smile. It has the 
advantage of assuming volatility brings another source of randomness, and this randomness 
can be hedge if it is also a Brownian motion. Furthermore, calibration of stochastic volatil¬ 
ity models does have some robustness to the market’s daily changes. However, whereas 
local volatility models are generally calibrated to fit the implied volatility surface (i.e. to 
fit options of all maturities), stochastic volatility models have trouble fitting more than 
one maturity at a time. 

Consider the following stochastic volatility model, 

dS t = jiS t dt + a(X t )S t dWt (7.1) 

dX t = a{X t )dt + P(X t )dB t (7.2) 

where Wt and Bt are Brownian motions with correlation p € (—1,1) such that E[ dWtdBt] = 
pdt, a(x) > 0 is the volatility function, and Xt is the volatility process and is usually mean- 
reverting. An example of a mean-revering process is the square-root process, 

dX t = k(X - X t )dt + 7 V fx t dB t . 

The square-root process is used in the Heston model (along with a(x) = y/x), and usually 
relies on what is known as the Feller condition (y 2 < 2 Xk) so that Xf never touches zero. 


7.1 Hedging and Pricing 

Assuming that a and j3 are ‘well-behaved’ functions, we define the differential operator 
, . d <j 2 (x)s 2 d 2 d (d 2 (x) d 2 , J x d 2 

c= m + -^—a? + + — + + • 

the interpretation of C is the following: — 2 * * <F? + are the generator of S); 

^- 7 ^g^ + a(V t )^ are the generator of X t ; p/3(X t )<j(X t )S t -g is the cross-term; and the 
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remaining are the separate stochastic terms. We use when applying bi-variate Ito Lemma 
for the process in (7.1) and (7.2): 

For any twice differentiable function f(t,s,x), the Ito differential is 


r\ r\ 

df(t, S t ,X t ) = Cf(t, S t , X t )dt + a(X t )S t —f(t, S t , X t )dW t + P{X t )—f(t, S t , X t )dB t . 


(7.3) 


The price C(t,s,x) is the price of claim i/}(s,x) paid at time T given St = s and 
Xt = x. We hedge with a self-financing portfolio Vj consisting of risky-asset, the risk-free 
bank account, and a 2nd derivative C that is settled at time T' > T. This method for 
hedging stochastic volatility is called the Hull-White method. 

The self-financing condition is 

dV t = a t dS t + btdC'it, S t , X t ) + r(V t - a t S t - b t C'(t, S t ,X t ))dt . 


Now, taking a long position in V) and a short position in C(t, St, X t ), we apply the bivariate 
Ito lemma of (7.3) to get the dynamics of this new portfolio: 


d(Vt — C(t, S t ,X t )) 


= a t (iiS t dt + a(Xt)StdWt) 


r\ r\ 

+b t CC'(t, St, X t )dt + b t a(X t )St—C'(t, St,X t )dWt + b t p(X t )—C'{t, S t , X t )dB t 


+r(Vt — a t S t — b t C'(t, S t , X t ))dt 

r\ r\ 

-CC(t,St,X t )dt - a{X t )St—C(t,St,X t )dWt - P(X t )-^C(t, S t ,X t )dB t . 

We now choose at. and bt so that the dWt and the dBt terms vanish, which means that 

r\ r\ 

cT{X t )S t —C(t,St,X t ) = ata(X t )S t + b t a{X t )St^-C'(t, S u X t ) 
os os 

P(X t )^C{t,St,X t ) = b t /5(X t )^-C'(t,S t ,X t ) 


or 
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(7.4) 


b &C(t,St,X t ) 

* £<C'(t,St,x t ) 

r\ r\ 

a t = 7 - C\t, S t , W) - 6t—C"(t, 5 t , X t ) . (7.5) 

as as 

Plugging these solutions into the differential, the a{X t )-^ terms cancel, the ySt terms 
cancel, and by the same arbitrage argument as Black-Scholes, it follows that Vt~C(t, St,Xt ) 
must grow at the risk-free rate: 


d(v t -c t (t,s t ,x t )) 

= atfxStdt + b t LC\t, St, X t )dt + r(V t - a t S t - b t C'(t , St, X t ))dt - £C(t, S t ,X t )dt 
= r(V t — C(t, S t , X t ))dt . 


Inserting at and bt from equations (7.4) and (7.5), and after some rearranging of terms 
(and dropping the df s), we get 


r)S t ~ - r^j C(t, S t , X t ) = b t [c~{ l i- r)S t -^ - C'(t, S t , X t ) 


and dividing both sides by J^C(t,S t ,X t ), we get 


(£ - (M - r)S t & - r) C(t, S t , X t ) _{£-(»- r)S t & - r) C'(t, S t , X t ) 


£C{t,St,X t ) 


£C’(t,St,X t ) 


= R(t, s, x) 


The left-hand side of this equation does not depend on specifics of C’ (e.g. maturity and 
strikes), and the right-hand side does not depend on specifics of C. Therefore, there is a 
function R(t, s, x ) that does not depend on parameters such as strike price and maturity, 
and it equates the two sides of the expression. We let R take the form 

R(t, s, x) = —a(x) + /3(x)A(t, s, x) 


where 

A(i, s, x) = 4 + g(t, s, x)y/l - p 2 , 

(T{x) 

where g is an arbitrary function. Hence, we arrive at the PDE for the price C[t, s, x) of a 
European Derivative with stochastic vol: 


Proposition 7.1.1. (Pricing PDE for Stochastic Volatility). Given the stochastic 
volatility model of equations (7.1) and (7.2), the price of a European claim with payoff 


68 











if (St) satisfies 


8 a 2 (x)s 2 d 2 d 2 

m + ^-d^ + pl3ixMx) ’a^- s 


d (3 2 (x ) d 2 d . 

+ a(x) — + 7—^—— ^ + rs— - r | C 


2 dx 2 


ds 


d 


= j3(x)A(t,s,x)—C 


(7.6) 


with C 


= *P(s,x). 


The PDE in (7.6) can be solved with Feynman-Kac, 

C(t,s,x) = e- r ( T -VEQ{fi(S T ,X T )\S t = s,X t = x} . 
which yields a description of an EMM, Q, under which the stochastic volatility model is 

^ = rdt + a(X t )dW t Q 

Jt 

dX t = (a(X t )~ /3(X t )A{t,S t ,X t ))dt + p(X t )dB? 

where W® and B® are Q-Brownian motions with the same correlation p as before. The 
interpretation of A(t, s, x) is that it is the market price of volatility risk. 

One can think of a market as complete when there are as many non-redundant assets 
as there are sources of randomness. In this sense we have ‘completed’ this market with 
stochastic volatility by including the derivative C' among the traded assets. This has al¬ 
lowed us to replicate the European claim and obtain a unique no-arbitrage price that must 


be the solution to (7.6). 


Example 7.1.1. (The Heston Model). For the Heston model described earlier, the 
EMM Q is described by 

dS t = rS t dt + y/XtdW? (7.7) 

dX t = k(X - X t )dt + \X t dt + -i^fXtdB® (7.8) 

where \X t is the assumed form of the volatility price of risk. This model can be re-written, 

dX t = k(X- X t )dt + 7 \/XtdB? 

where R = k — A and X = ■ It is important to have the Feller condition 

7 2 < 2 RX 

otherwise the PDE requires boundaries for the event X t = 0. The process X t is degenerate, 
but it is well-known that the Feller condition is the critical assumption for the application 
of Feynman-Kac. The fit of the Heston model to the implied volatility of market data is 


shown in Figure 7.1 
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maturity: 08/17 maturity: 09/21 




log(F/K) log(F/K) 



log(F/K) 


maturity: 11/16 



Figure 7.1: The fit of the Heston model to implied volatility of market data on July 27 of 
2012, with different maturities. Notice that the Heston model doesn’t fit very well to the 
options with shortest time to maturity. 
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7.2 Monte Carlo Methods 


A slow but often reliable method for pricing derivatives is to approximate the risk-neutral 
expectation with independent samples. Essentially, sample paths of Brownian motion 
are obtained using a pseudo-random number generated, the realized derivative payoff is 
computed for each sample, and then the average of these sample payoffs is an approximation 
of the true average. For example, let i = 1, 2,..., N, let Sip be the index for an independent 
sample from St's probability distribution. If E®< oo, then by the law of large numbers 
we have 

i N 

— - K ) + E q {S t - K) + . 

v e=i 

For stochastic differential equations, the typical method is to generate samples that 
are approximately from the same distribution as the price. The basic idea is to use an 

Euler-Maruyama scheme, 

log = log ~S? + {h ~ ^ 2 ) A t + a(W t % - Wf) , (7 .9) 

for some small At > 0, with 

W t + At ~ W t W ~ iidAf(0, At) . 

This scheme is basically the discrete backward Riemann sum from which the Ito integral 
was derived (see Chapter [2]). In the limit, the Monte-Carlo average will be with little-o of 
the true expectation, 


lim 

TV— >-OC 


i - K)+ = E Q(St - K) + + o(At) 


l= l 


The scheme in (7.9) can easily be adapted for a local volatility model, and can also be 


adapted to stochastic volatility models, but the latter will also need a scheme for generating 
the volatility process Xt. 


Example 7.2.1. (The exponential OU model). Consider the stochastic volatility model 

^ = rdt + e Xt dW? 

St 

dX t = k(X- X t )dt + 7 dBf 


with correlation p between W® and B®. The Euler-Maruyama scheme is 

log S t+At = log S t +(r- ^e 2Xt ^j At + e Xt (p(B t+At - B t ) + y/l - p 2 (W t+At - Hy)) 
Xt+At = X t + k — X t ^j At + 7 (Rj+Ai ~ Bt) 
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where B^ At — B*jp and wj+ At — are increments of independent Brownian motions. 

Sometimes, there more clever methods of sampling need to be employed. For instance, 
the Heston model: 


Example 7.2.2. (Sampling the Heston Model). Recall the Heston model of (7.7) 
and (7.8). A scheme similar to the one used in the previous example will not work for the 
Heston model because Euler-Maruyama, 


X t+ At = x t + k{X - X t )At + ( B t+At - Bt ) 

will allow the volatility process to become negative. An alternative is to consider the 
Stratonovich-type differential Q 

dX t = (k(X - X t ) - y) ^ + 7 \fXt o dB t 

and then use an implicit Euler-Maruyama scheme 

-X'=(k{X- X t+At ) - Bj At + jfEX, (-Bt+At - Bt) . 


which can be solved for y X t+At using the quadratic equation. Notice that a condition for 
the discriminant to be real is the Feller condition that 7 2 < 2kX . 

In general, Monte Carlo methods are good because they can be used to price pretty 
much any type of derivative instrument, so long as the EMM is given. For instance, it is 
very easy to price an Asian option with Monte Carlo. Another advantages of Monte Carlo 
is that there is no need for an explicit solution to any kind of PDE; the methodology for 
pricing is to simply simulate and average. However, as mentioned above, this can be very 
slow, and so there is a need (particularly among practitioners) for faster ways to compute 
prices. 

Another important topic to remark on is importance sampling, which refers to 
simulations generated under an equivalent measure for which certain rare events are more 
common. For instance, suppose one wants to use Monte Carlo to price a call option with 
extreme strike. It will be a waste of time to generate samples from the EMM’s model if 
the overwhelming majority of them will finish out of the money. Instead, one can sample 
from another model that has more volatility, so that more samples finish in the money. 
Then, assign importance weights based on ratio of densities and take the average. See 
[Glasse rman, 2004 for further reading on Monte Carlo methods in finance. 

1 This refers to the Stratonovich-type integral that is the limit of forward Riemann sums, f f s o 
dB a = limjv X)n=i — For the square-root process, the Ito lemma on \fXf is used to 

show that 7 f \/X~ s dB a = 7 lim n y /X trl _ 1 AB tn _ 1 = — 7 lim„ (y/X tn - ^/X tn _ 1 ^ (B tn - B tn _ 1 ) + 
7 lim n VXff( B t„ ~ B tn _f) = — / ds + 7 / \fXT a o dB s . 


72 











7.3 Fourier Transform Methods 


The most acclaimed way to compute a European option price is with the so-called Fourier 
transform methods. In particular, the class of affine models in Duffle et al., 2000 have 


tractable pricing formulae because they have explicit expressions for their Fourier trans¬ 
forms. The essential ingredient to these methods is the characteristic function, 


4 >t{u) = E^ exp(uilog(Sr)) Vu E C, i = 


where E^ now denotes expectation under an EMM. The function 4 >t{u) is the Fourier 
transform of the log St 's distribution function, 

/ OO 

e lus f{s)ds 

-OO 


where f(s) is the density function for log St- The inverse Fourier transform is f(s) = 
Tf f-oo e- lus cf T (u)du, which leads to the cumulative distribution function 


F(s)-F(0) = f° f(v)dv=^ 


1 — e 


iu 


-<j>T{u)du 


The crucial piece of information is the characteristic function, as any European claim 
can be priced provided there is also a Fourier transform for the payoff function. Let pr 
denote the density function on the risk-neutral distribution of log(Sr) and let if denote 
the Fourier transform of the payoff function. It is shown in [Lewis, 20001 that 


/ oo coo i riz-\- oo ^ 

if(s)f(s)ds = — e~ ms if(u)duf(s)ds 

-oo J —oo ^ Jiz —oo 


i 

27T 


riz-\-oo ^ pc 

/ / 

J iz —oo J— c 


S pr{s)dsdu = — 

Z7T 


L 


iz-\- oo 


ip(u)(j)T(—u)du 


where the constant z E 


in the limit of integration is the appropriate strip of regularity 

This usage of characteristic functions is 


for non-smooth payoff functions (see Table 7.1) 
far reaching, as the class of Levy jump models are defined in terms of their characteristic 
function, or equivalently with the Levy-Khintchine representation (for more on financial 
models with jumps see [Cont. and Tankov, 2003|). 


7.4 Call-Option Pricing 

An important tool is the formula of Gil-Pelaez: 
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Table 7.1: Fourier Transforms of Various Payoffs, if(u) 
h fiz-™ e~ ius ^(u)du, z = 3(u). 


i°°oo e lus if(s)ds and 'ijj(s) = 


asset 

V’(s) 

V’N 

regularity strip 

call 

(e s - K) + 

fciu+1 

u 2 —iu 

z 

> 

1 

put 

(. K - e s )+ 

fciu+l 

u 2 —iu 

z 

< 

0 

covered call; cash se¬ 

nrin(e s , K ) 

fciu+1 

u 2 —iu 

0 < 

z 

< 1 

cured put 






cash or nothing call 

\e 3 >K] 

K iu 

iu 


> 

0 

cash or nothing put 


K iu 

iu 

z 

< 

0 

asset or nothing call 

eS l[e s >K] 

fciu+l 

iu-\- 1 

z 

> 

1 

asset or nothing put 

eS l[ e s<x] 

1-1 

iu-\- 1 

z 

< 

0 

Arrow-Debreu 

5{s - log(iF)) 

K iu 

z 

G 

R 


Proposition 7.4.1 (Gil-Pelaez Inversion Theorem). The distribution function F(s ) is 
given by 




r °° e ms (j)T(—u) — e 


s (j) T {u) 


iu 


du , 


(7.10) 


for all s G (— oo, oo). 


Proof. First let’s prove an identity, 

if v < s 
if v = s 
if v > s 


r-R e iz \ 
dz+ —dz ) . 

Je Z ) 

A closed path can be constructed from — R to R, making a semi-circle of radius e around 
zero, and then from R back to —R on a semi-circle of radius R. Along this path the integral 
of eh z jz is zero, 


2 f c 

sign(u — s) = — / 

11 Jo 


sin(tt(t> — s)) 


du = 


u 


-1 

0 

1 


This integral is shown using residue calculus: 


sin (u) , 1 

- du = - 

u 2 


^-^du = - lim lim Im 

U 2 e —>-0 R —>-0 


/ 


-R 2 





e 

—dz = 0 

z 
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where C e = {ee 1 ^ ^|0 < 6 < it} and Cr = {Re* e |0 < 9 < 7 r}. Now, 


f e lz , 

1 

/ —dz 


Jc R z 

~ R 


L 


7r giRcos( 0 )—Rsin(O) 


oiO 


d6 


oiO 


0 & 

1 r 7r p iRcos{6)—Rsin(Q) 

<-d 

-U- 

7 r 

< — —>• 0 as i? —)► oo 

IX 


dO 


—R sin 


M °) d 0 


and 


Hence, 


f e“ f 

L T* = L 


0 1 + O(e) j, 


iee l9 dd = —in + 0(e) . 


ee 


16 


/o u 


_, , d* + / d*) = 

2 e^O i?-s>0 \J—R Z Je z J 2 


Sin( ^du = 1 lim lim Im 


Now notice that 


1 f°° e lus (p T (-u) - e~ ius (j)T(u)_ du _ 1 j ,0 ° ^°° _ e iu( v ~s) ^ ^ 


7T 


/ 


7U 


rr 

77 Jo 7 —oo 
2 /•oo /-oo 


IU 

sin(u(u — s)) 


7T Jo 

roo 


u 


dF{y)du 


In/ ,2 Z -00 sin(tt(i; — s)) 

dF(v)— / ——--du 

vr Jo u 

roo 


= / dF(u) - / dF(u) 

J —oo J s 

= 2F( S ) - 1 , 


which proves (7.10). 


□ 


Notice that (7.10) can be further simplified to have 

e~ ms J>r[u ) 


1 1 Z 100 

F(s) = - - - / Re 
^ ^ Jo 


m 


du , 


which is used in pricing a European call option, as the risk-neutral probability of finishing 
in the money is 


1 1 r°° 

n 2 = Q(St >K) = - + - Re 
^ 71 Jo 


g-tulogW^,^) 

iu 


du 
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where Q denotes probability under the EMM. The ‘delta’ of the option is 


p-rT -tT ro o 

Hi = Q ^ Q S T t [ST > K] = Q / e s f(s)ds , 

JQ *0 JlogK 

but we can define a change of measure, 

dQ _ e ST _ e ST 
dQ “ E Q e *r ~ S 0 e rT ’ 


where st = log St . Then, 


¥ qaus t = E Q e (1+iu)8r = Mu-i) 
jgQg^T i) ' 


and therefore we have the formula 


11 r°° 

IIi = Q(S t >K) = - + - Re 
* n Jo 


e -mlog {K)^ u _ Q 

iuc/>T(—i ) 


du 


Assuming no dividends, the value of a European call option with strike K, at time t = 0, 
is 


C = 5 0 ni - Ke~ rT n 2 . 


(7.11) 


The pricing formula of equation (7.11) is (in one way or another) interpreted as an inverse 
Fourier transform. Indeed, the method of Carr and Madan fCarr et al., 1999 shows that 


(7.11) can be well-approximated as a fast Fourier transform, which has the advantage of 


taking less time to compute than numerical integration, but it is also less accurate. Equa¬ 


tion (7.11) has become an important piece of machinery in quantitative finance because it 


is (or is considered to be) an explicit solution to the pricing equation. In general the pricing 


formula of (7.11) applies to just about any situation where the characteristic function is 
given. 


7.5 Heston Explicit Formula 


Finally, an important application of the Fourier pricing formula in (7.11) is to the Heston 
model. The call option formula for the Heston model from equations (7.7) and (7.8) has a 
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well-known inverse Fourier transform (see [Gatheral, 2006]) with 


yrheston 

3 

4> J A U ) 


Aj{u) 


Bjiu) 


9j 

dj 

Cl 


+ 


J o 


Re 


0 —iu lo, 


n (u) 


du for j = 1, 2, 


ixp ( Aj(u ) + Bj(u)X 0 + m (log (So) + rT)) 

^ ({bj ~ pjui - dj)T - 2 log j^—) ) 

— nrvuA. — H : / 1 — p~dj r \ 


/07m — dj f 1 — e _dj ' 

7 2 \1 — gje~ d i r ) 

— fr/ui — dj 

— fr/ui + dj 

(/97m — bj ) 2 — j 2 { 2 cjui — it 2 ) 


which is applied for the Heston model the volatility price of risk absorbed into parame¬ 
ters k and X. Warning: Use a well-made Gaussian quadrature function for numerical 
computation of the integrals for the Heston price; do not attempt to integrate with an 
evenly-spaced grid. 


7.6 The Merton Jump Diffusion 

Short time-to-maturity options are known to be well fit by the class of models where 
log-asset prices are Levy processes. For instance, the Merton jump diffusion, 

7 O 

—d = (r - v) dt + adW t + (e Jt - l) dN t 

where a > 0, N t is an independent Poisson process with parameter A G (0, oo), Jt is a 
sequence of i.i.d. normal random variables with rs_/ N(hj, aj) when N u - N u _ = 1, and 
/ \ 

v = A I e^ J+ 2 — 11 is a compensator to ensure the process is a discounted martingale. 
The log-price is 


log(VSo) 




du + aWt + 



JydN u 


N t 

du + <7 Wt + 'y ^ Jn ■ 

n =1 
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For Sq = 1, the characteristic function is derived as follows, 


= IE exp ( iu ( r — v —— J T + iuaWr + iu j JtdN t 


= exp \iu\r — v — 


a 


a 2 u 2 

T - T + XT 


jlUfAJ 



This is an example of a characteristic function that is known explicitly. In addition, it is 
particular example of the general Levy-Khintchine representation, 


Eexp (iulog(ST)) 

= exp {^iu — is — y ^ T - + T ( e lux - 1 - iux l| x |<i) rj(dx)^j , 

where rj is the intensity measure, satisfying / K \j 0 j 1 A x 2 r](dx) < oo. For more on the 
Levy-Khintchine representation, see [Cont. and Tankov, 2003]. 
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Chapter 8 


Stochastic Control 


This chapter takes techniques from stochastic control and applies them to portfolio manage¬ 


ment. The portfolio can be of varying type, two possibilites are a portfolio for investment 
of (personal) wealth, or a hedging portfolio with a short position in a derivative contract. 
The basic problem involves an investor with a self-financing wealth process and a concave 
utility function to quantify their risk aversion, from which their goal is to maximize their 
expected utility of terminal wealth and/or consumption. To exemplify the need for hedging 
obtained from optimal control, recall the price of volatility risk A (t, s, x) from Proposition 
7.1.1| of Chapter]?} The pricing PDE for stochastic volatility depends on A, but incomplete¬ 
ness of the market means that A may not be uniquely specified. However, an expression 
can be obtained from the solution to an optimal control, hence writing A as a function of 
the investor’s risk aversion. This chapter will start by considering the basic problem of 
optimization of (personal) wealth, and later on will show how optimal control is used in 
hedging derivatives. 

8.1 The Optimal Investment Problem 

Consider a standard geometric Brownian motion for the price of a risky asset, 


7 

-p w = M dt + adW t 


( 8 . 1 ) 


S t 


where /j, £ M, a > 0, and VP is a Brownian motion under the statistical measure. There 
is also the risk-free bank account that pays interest at a rate r > 0. At time t > 0 the 
investors has a portfolio value Xt with an allocation nt in the risky asset and a consumption 
stream ct- The dynamics of the portfolio are self-financing, 



( 8 . 2 ) 
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where 


7 Tt = proportion of wealth in the risky asset, 
ct = rate of consumption. 


A natural constraint on consumption is c* > 0 for all t > 0, and based on (8.2) a constraint 
of > 0 is enforced automatically. Here we have taken Xt > 0 almost surely, but in 
general any hnite lower bound on Xt is necessary to ensure no-arbitrage (i.e. Xt > —M > 
—oo almost with constant M hnite), otherwise there could be doubling strateguies. The 
optimization problem is then formulated as 


V(t, x) = max E 

7T,C>0 


J F(u, c u , X u )du + U(X T ) 


X t = x 


(8.3) 


where F is a concave utility on consumption and wealth, U is a concave utility on termi¬ 
nal wealth, and the admissible pairs (nt,ct)t>o are non-anticipating, adapted to W, with 

Jq \irtX t \ 2 dt < oo almost surely. We refer to E /) T F(u, c u , X u )du + U(Xt) X t = x 
the objective function, and refer to V as the optimal value function. 


as 


Example 8.1.1 (Logarithmic utility). Suppose that F = 0 and U(x) = log(x). There is 
no utility of consumption, so the optimal is ct = 0 for all t > 0. Now apply ltd’s lemma, 


dlog(Xt) = rdt + 7r t 
and then taking expectations, 

E[log(A T )|A t = x\ = log(x)+E 




dSt \ c7“7rr 

— - rdt) —<tt , 


T / ( \ 0 ’ 27r « \ 

r + tt u [p - r) - — | du 


X t = x 


where the right-hand side is concave in nt- Hence, the optimal strategy is 

p — r 


n = 


a^ 


Vi e [0, T] 


which is the Sharpe ratio divided by the volatility. The optimal value function is 


V(t,x ) = max E [log(A 7 -) | A t = x] = log 
and using U~ 1 (v) = e v , we find the certainty equivalent, 


r+ 


U^-rY 


C T-t ) 


2cr 2 


X ce = z-rlT-t^- 1 (y^ Xt )) = X t e 
which is the risk-free rate plus | times the Sharpe-ratio squared. 


(T-t) 
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Example 8.1.2 (Log-Utility of Consumption). Suppose that F(t,ct,X t ) = e ^log (cfX t ), 
U(x ) = 0, and T = oo. The optimization problem is 


max E 

7T,C>0 


e t] log (c u X u )du 


Ut 


X t = x 


V(x) , 


which is constant in t. Now notice for any admissible ( 7 r, c) on [t, t + At] we have the 
dynamic programming principle, 

/ t+At 

e-diu-i) i og ( Cu X u )du , 

with equality if and only if ( 7 r, c) is chosen optimally over [t,t + At], and hence 


E t V(X t+At ) - V(X t ) 

At 

1 p —/ 3 A t 1 rt+At 

< -^- E t V(X t+At ) - —E t j e -P( u ~ t ) log (c u X u )du 

->• PV(Xt) - log (ctX t ) , 


as At —> 0. On the other hand, from ltd’s lemma we have 
dV(X t ) = 


A 2 v2 q 2 q 

-^T^dx2 V{Xt) + (r + ~ r) “ Ct)Xt dx V{Xt) ) dt 

+ air t X t -^-V(Xt)dW t , 

and assuming the Brownian term vanishes under expectations, we have 
E t V(X t+At ) - V(X t ) 


= -J-E t 

At 


At 

•t+At / 2 2 j ^2 q 2 q 

-^-^—V(X u ) + (r + Tr u (p-r)-c u )X u —V(X u ))du 


a 2 2 v2 s2 a 

V(X t ) + (r + n t (p - r) - a) X t -^V{X t ) , 
as At — > 0. Hence, for all admissible pairs ( 7 r, c) the value function V(x) satisifies 

g.2^.2g,2 q2 q 

~2~~daF V ( x ) + + ~ ^ ~ X dx V ^ ~ + log(cs) < 0 , 

with equality if and only it and c are optimal, which leads to the equation 


max 

7T,C>0 


^ 2 ^. 2^2 q2 q 

° ^ X 7^2 V(x) + (r + 7 r(p - r) - c) x—V{x) - j3V(x) + log(cx) ) = 0 . 
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Let’s assume the ansatz 


V(x) = a log(x) + b . 

Then through first-order optimality conditions (i.e. by differentiating with respect to n and 
setting equal to zero) we find the optimal 


ir(x) = — - 


^ £lV(x) CT* 

Similarly, first-order optimality conditions for c yield 

1 1 


clx = 


x £ v (x) a 


Putting optimal Tit and ct back into the equation for V along with the ansatz, we find 

log (x/a) - (3(alog(x) + b) + (ar - 1) + " ^ / - = 0 , 

2 a 1 

and comparing log(x) terms and non-x-dependent terms we find, 

1 

a = -r 


b = 


1 {p — r) z r 1 


2/3 2 a 2 P 2 ' p 


+ -52 + o (log (P) ~ 1 ) 


8.2 The Hamilton-Jacobi-Bellman (HJB) Equation 


Example 8.1.1 is useful to get started and to get a sense for how an optimal control should 
look. Example 8.1.2 is more instructive because it shows us how (i) the function V inherits 
concavity from F and U, and (ii) how it also shows how to derive the PDE that V should 
satisfy. 

The derivation starts with the dynamic programming principle, 


V(t,x) = maxE 

ir,c> o [ Jt 


rt+At 


F(u, c u , X u )du + V(t + At, X t+At ) 


X, = x 


where max,,-^ is taken over the interval [t,t + At]. Applying Ito’s lemma to V(t,Xf), we 
find 


V(t + At, Xt+ A t) 
= V{t,X t ) + 


t+At ( q cr 2 7r 2^'2 q2 q 

^2 + ( r + -r)~ C U )X U — ) V(u,X u )du 


Wt + 


2 

rt+At. 


d 


T o’y tt u X u V(u, X u )dW u , 
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for any admissible (n, c) over [t, t + At]. Hence, for any ( 7 r, c) on [t, t + At] we have 


E 


ri+At / q 2 2 X 2 q2 

[f(u,c u ,X u )+[-+ u u 


2 dx 2 
d 


+ (r + ir u (n - r) - c u ) X u — 1 V(u,X u ) I du 


X, = x 


<0 , 


with equality iff and only if an optimal ( 7 r, c) is chosen. Hence, dividing by At and taking 
the hint to zero, we obtain the so-called Hamilton-Jacobi-Bellman (HJB) equation: 


( ( d a 2 nr 2 x 2 d 2 

mg l F (t,c)+l m + —- — 2+ (r + *(»- r) 


-o)xXjv(t,x)J ^0 , (8.4) 

V(T, x) = U(x) . 


8.3 Merton’s Optimal Investment Problem 

Let F = 0 and consider a power utility function, 


U{x) = 


x 


1-7 


1 -7 ’ 

where 7 > 0, 7 7 ^ 1 is the risk aversion. The problem is to solve 

V(t,x) = max E [U(X T ) | X t = x] , 

7T 

The HJB equation for this problem is 

3 3 \ {3 ^ 3 

¥t +rX foc) V ( t ’ x ) + ™ x {^—fa2 V ( t ’ x ) + 7r (d-r)x—V(t,x) ) = 0 , (8.5) 


V{T,x) = U(x) , 


for which we find the optimal 7 r, 


TTt = - ■ 


d~r &V(t,x) 


x * 2 &V(t,x) ‘ 


Inserting the optimal 77 into (8.5) we obtain the nonlinear equation, 

9 , 9 \^n ^ 

— + rx— )V(t, x) - --- — = 0 . 

dt dx) 2a 2 £rV(t,x ) 


( 8 . 6 ) 
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Then using the ansatz V(t,x) = g(t)U(x), we find 


^V(t,x) = g'{t)U{x) , 
-K~V(t,x) = - —- g{t)U(x) , 

OX X 

Q <7(1 — 7) 
-^ v {t,x) = - 1 -^^ g {t)U{x) 


and inserting in ( 8 . 6 ) we find an ODE for g, 

^) + r(l-,) + fiz^!»W_o 

with terminal condition g(T) = 1. The solution is 

(1-7)(T-t) (r+ (ll ~ r f } 
g(t) = e ; v , 

and the optimal value function is 


xe 


(^)(r+^ 


1-7 


V(t,x) = U (x)g(t) = 


and the certainty equivalent is 


1-7 


= U \ xe 


(T-t)[r+^ 


X? = e -r(T-t)jj~l^ Xt ^ = X J T \ 27 


(.P-ry 


8.4 Stochastic Returns 

Consider the model 


dS t = Y t S t dt + aS t dWt (8.7) 

dY t = n{p - Y t )dt + {3dB t , ( 8 . 8 ) 


with dWtdB t = pdt where p G (—1,1). The interpretation of Y t could be any of the 
following: Yj is a dividend yield with uncertainty (although somewhat of strange model 
because it can be negative), or Yt is the return rate on a commodities or bond portfolio 
where there is a role yield due to contango or backwardation. 

Let’s assume the simple case p = r = 0, for which the value function is 


V(t,x,y) 


maxE 


U{X T ) 


X t = x,Y t = y 
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and has HJB equation 


d p 2 d 2 d\ T „ 

m + -2By^- Ky By) V(t ’ X ’ y) 

f (T^X^TT^ 1 9 ^ 0 

+ rnI —^— fa? V ( t,x ’ y ') + Kxy—V(t,x,y) 


d 2 \ 

+pTTxPa-^-^-V{t, x, y) j = 0 , 


V(T,x,y) = U(x ) . 


(8.9) 


The first-order condition for 7 r yields the optimal 


xy-^V(t,x,y) + pirxPa-^Vii^^) 


VTt = - 


cr 2 x 2 £ 7 V(t,x,y) 


For the power utility 


U(x) = 


4-7 


1 - 7 ’ 

we have the ansatz V(t,x,y ) = U(x)g(t,y) with 


dx 2 

d 2 

dxdy 


V = - 


(1 - 7)7 


g(t,y)U(x) 


1 — 'y d 

V = - —g(t,y)U{x) , 

x ay 


all of which are inserted into equation (8.9) to get an equation for g: 

d P 2 d 2 


7T7 + 


d 


1-7/ pP °^ 9 ( t , y ) 


dt 1 2 dy 2 + 2 cj 2 7 l y+ g(t,y) 


g(t,y) = 0 
g(T,y) = 1 


We now apply another ansatz g(t,y) = e a ^ y2+b ^\ which when inserted into the equation 
for g(t, y) yields the following system: 

■1 ~ 


P : a!(t) = - 2 ^ (l + a 2 (f) - 2 - «) a(t) - 


1 ~7 
2ct 2 7 ’ 


1 : b'(t) = ~p 2 a(t) , 
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with terminal conditions a(T) = b(T ) = 0. The solution a(t) can be written as a ratio, 


a(t) 


_ At) _ 

2j3 2 + (bM) „(() ' 


where v(t) is the solution to a 2nd-order ODE, 


(77 


The roots of this equation are 


a 2 7 


7 


m± = _ (7/so^ 7 K / a 

\ O / V (T 7 V °7 

which gives the general solution 

v(t) = C'ie m+ ^ T- ^ + C , 2 e m - (T_t) . 

It is not necessary to fully determine constants C\ and C 2 because we are mainly interested 
in the ratio v'(t)/v{t). 

Finite-Time Blowup. Complex valued rn± leads to finite-time blowup for the op¬ 
timization problem. If the roots are complex then let c = — and d = 

^2 Kp + — k 2 so that the general solution is 

v(t) = [(J\ cos (d(T — t )) + C 2 sin (d(T — t ))^ , 


and with v'(T) = —cC\ — dC 2 = 0 to satisfy the terminal condition a(T) = 0, so that 

v(t) = Cie c ( T ~^ ^cos (d(T — t)) — - sin (d(T — t))^ . 

The solution a(t) will blow at time t* such that cos (d(T — t*)) — ^ sin(d(T — t*)) = 0, that 
is tan(d(T — t*)) = | or 


T — t* 


1 

d 


^7rl[ c <0] + tan 



86 



8.5 Stochastic Volatility 


Now let’s consider the same optimal terminal wealth problem as the Merton problem, with 
exponential utility 

U(x) = — —e~ lx where 7 > 0 , 

7 

and in the incomplete market of stochastic volatility, 

dS t = fiS t dt + a(Y t )S t dW t 
dY t = a(Y t )dt + P{Y t )dB t , 


( 8 . 10 ) 

( 8 . 11 ) 


where dWt ■ dB t = pdt. From (8.10) and (8.11), we have the wealth process, 

' dS t 


dX t = rXfdt + 77 


— rdt 


no longer enforcing the non-negativity constraint. The optimization problem is 


V(t,x,y) = maxE U(Xt) 


X t = x, Y t = y 


but the technique used in Example 8.1.1 does not apply because there is some local martin¬ 
gale behavior in the stochastic integrals. Instead, we arrive at the optimal solution using 
the HJB equation. The HJB equation is 

d f3 2 (y) d 2 q \ 

12 +a yy>xr .) v \ t ^ x ^y) 


d 


dt +TX dx + 2 dy 2 


dy) 


( a 2 (y)7r 2 d 2 d 

+ max ( - 2 - ]h? V ( t,X,y ^ + 7r (/ J '- r )fa.V( t , x ,y) 


+pir/3(y)cr(y) 


Using the ansatz V(t,x,y) = U(xe r ^ T t ' ) )g(t,y), we have 


d 2 


dxdy 


V(t,x,y)j= 0, (8.12) 

V(T,x) = U(x) . 


Tj-V = 7rxe r(T + U(xe r<yT t) )^g(t,y) , 

|y = 7 w-<v, 

J? V = - 7 e’-(r-t) t/ ( xe r(T- t)) |. (t ) 5 
axay oy 
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which we insert into (8.12) to find a PDE for g, 


d p 2 (y) d 2 . x d . . . 




+ mm 


72 e 2r 0r t ) C r 2 (y) 7 r 2 


9(t,y)~ 7e r(T t) n(^ti-r)g(t,y)+pP(y)a(y)-^g(t,y)j j = 0 


(8.13) 
g{T, y) = 1 


and the optimal strategy is 

_ _ r(T -t) / /i~T- 


n t = e 


P{y) %j9(t,y) ' 

i° 2 {y) ' r 7o-(y) 9(*,y) 




Inserting this optimal 7p into (8.13) yields the nonlinear equation 

a 


d P 2 {y) d 2 


¥t + 


0 

+ a (y)TT- ) g(t,y) - 


2 dy 2 ' dy J 2 


o 2 {y) (y~ r , P(y) &a9{t,y) 


v 2 (y) +P °(y) g(t,y) 


g(t,y ) = o . 

(8.14) 


This equation can be reduced to a linear PDE if we look for a function ip(t,y) such that 

g(t,y) = t,y) q , 

where q is a parameter. Differentiating yields, 

d qgd 
9 = ^dt v 


dt 

_o•_ = og_^_, 
dy^ if dy 


1 d 2 


% 


;lf + 


q — 1 f d 


dy^ 


2 g ( m i , t p Qy2 t 1 ^2 

and then plugging into (8.14) with chosen parameter q = l/(l + p 2 ) yields a linear equation: 

(n~r)P(y)\ d \ (y~r ) 2 


d (3 2 {y) d 2 
2 dy 2 


Of 2 V + (“(»)-*> r(y) = (8 ' 15) 

Example 8.5.1 (Fully Affine Heston Model). Consider a futures contract Ft t with set¬ 
tlement date T and stochastic volatility and returns, 


dFi 


— = yYtdt + ^jYtdWt 


Ft,T 

dY t = k(Y - Y t )dt + fd^YtdBt 



















where /3 2 < 2kY and dWtdB t = pdt. The wealth process for futures trading is 

dF t .r 


dX t = rX t dt + 7 Ti¬ 


ft 


t,T 


For U(x) = — y 1X the optimal terminal expected utility is 

V(t, x, y) = U(xe r ^ T ~ t) )'tp(t,y) q , 


where is similar to a solution to equation (8.15), except the equation has no r, 

'2 £& a \ 2 

+ (k(Y -y)~ ppfdy) —J if ft, y) - y) = 0 . 


d /3 2 y d 2 


k dt 2 dy 2 

It can he further shown that the solution to this equation is of the form 

il>(t,y) = e a ® y+b ® , 

with a(T) = b(T ) = 0, and where aft ) and b(t) satisfy ODEs, 

a'{t ) + y a 2 (t) - (k + puP) aft) - = 0 


b'ft) + kY aft) = 0 , 


both of which can be solved explicitly. 


8.6 Indifference Pricing 

Stochastic control for terminal wealth can be implemented to find the the price of a call 
option under stochastic volatility, 


V h (t, x, y, s) = maxE U(X T - (St - K) + ) X t = x,Y t = y,S t = s 


where the investor now hedges a short position in a call option with strike K. Compared 
to the same investor’s value function that is not short the call 


V°(t, x, y) = maxE U(Xt) 


X t = x,Y t = y 


we look for the amount of cash $p such that the 

V h (t, x + p,y,s) = V°(t, x , y) . 
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The extra cash makes the hedger utility indifferent to the short position. With exponential 
utility there is a separation of variables, 


V ( t , x, y, s ) = maxE 


_I p -7 {X t -{S t -K)+) 


L 7 


X t = x,Y t = y,S t = s 


= -e 

7 


—'yxe r ( T 7 


min E 


er{T n) ^u{^-rdu^-(S T -K)+^ 


Y t =y,S t = S 


U (xe r(T g h {t,y,s ) , 


where we’ve used the differential d (e r ^ T ^Xf) = e r ^ T — rdt^j. Hence we find a 

price $p such that U (pe r ^ T ~^) = g(t, y)/g h (t, y, s ), where g(t, y) is the solution from ( |8.13[ ). 

Depending on the risk-aversion coefficient 7 , there will be different prices $p. This 
brings us back to the price of volatility risk A (t, s,x) from Proposition 7.1.1 of Chapter [Tj 
Namely, investors with different risk aversion will have a different A for their martingale 
evaluation of the call option. 

If an indifference price is obtained then there is a solution to both optimization prob¬ 
lems, and hence there is no-arbitrage and the range of prices for $c will be a no-arbitrage 
interval. For complete markets there will be a single price $c for all levels of risk aversion. 
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Appendix A 


Martingales 


and Stopping Times 


Let J-f denote a cr-algebra. A process Xt is an T^-martingale if and only if 

Wt = X t Vt<T , 

where E* = E[ • A process Xt is a submartingale if and only if 

E t X T >X t Vt < T , 

and a supermartingale if and only if 

E t X T <X t Vt <T . 

Note that true martingale is both a sub and supermartingale. 

A.l Stopping Times 

In probability theory, a stopping time is a a stochastic time that is non-anticipative of the 
underlying process. For instance, for a stock price St a stopping time is the first time the 
price reaches a level M, 

t = inf{t > 0|iSt > M } . 

The non-anticipativeness of the stopping is important because there are some events that 
are seemingly similar but are not stopping times, for instance 

v = sup{f > 0|Si < M} 


is not a stopping time. 

Stopping times are useful when discussing martingale. For instance, there is the optional 
stopping theorem: 
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Theorem A.1.1 (Optional Stopping Theorem). Let X t be a martingale and let r be a 
stopping time. Let a least one of these 3 hold: 

1. P(r < oo) = 1, 

2. E|A r | < oo 


3. E 


\XtW>t] 


0 as t. —>■ oo. 


Then the so-called stopped process 


X tAr = X 


min (t,r) 


is a martingale. 

An example application of the optional stopping theorem is Gambler’s ruin: Let 

r = inf{f > 0| W t £ (a, b )} , 

where 0 < b < oo and — oo < a < 0. Then P(r < oo) = 1 and 
0 = Wo = E Wtr\ T 

—> aP(W T = a) + 6(1 — P {W T = a)) as t —»• oo 
= (a - b)¥(W T = a) + b, 

which can be simplified to get 

P(W T = a) = . 

b — a 

A.2 Local Martingales 

First dehne a local martingale: 

Definition A.2.1 (Local Martingale). A process Xt is a local martingale if there exists a 
sequence of finite and increasing stopping times r n such that P(r n —>• oo as n — > oo) = 1 
and X t /\ Tn is a true martingale for any n. 

Some remarks are in order: 

Remark 3. In discrete time there are no local martingales; a martingale is a martingale. 

Remark 4. A true martingale X t is a local martingale, and any bounded local martingale 
is in fact a true martingale. 
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The Ito stochastic integral is in general a local martingale, not necessarily a true mar¬ 
tingale. That is, 

It — I ffudWu i 
Jo 

is only a local martingale, but there exists stopping times r n such that 


rt/\T„ 


ItAr n — 


(T u dW u , 


is a true martingale. For Ito integrals there is the following theorem for a sufficient (but 
not necessary) condition for true martingales: 

Theorem A.2.1. The ltd integral a u dW u is a true martingale on [0, T] if 

r T 

E J cr 2 s ds < oo , 

i.e., the ltd isometry is finite. 

For the stochastic integral fi = [q a u dW u we can define 

T n = 


r 

ft ) 

f > 0 

J ^ erfds > n 


for which we have a bounded Ito isometry, and hence Theorem A.2.1 applies to make ItAr n 
a martingale on [0, T]. 

An example of a local martingale is the constant elasticity of volatility (CEV) model, 

dS t = oS?dW t , 

with 0 < a < 2; S'* is strictly a local martingale for 1 < a < 2. For a = 2 one can check 
using PDEs that the transition density is 


Pt{z\s) = 


z 3 V2nta 2 


JM) 2 

g 2 ter 2 — g 2 ter 2 


One can check that ES’f = oo for all t > 0 so that Theorem A.2.1 does not apply, but to 
see that it is a strict local martingale one must also check that EfS’r < St for all t < T. 
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Appendix B 

Some Notes on Fourier Transforms 


For a real-valued function f(x) with \f(x)\ 2 dx < oo, Fourier transforms are defined as 
follows 


/(“) = 


e lux f{x)dx 


i r°° 


e~ mx f(u)du . 


(B.l) 

(B.2) 


The Fourier transforms in (B.l) and (B.2) are somewhat different from traditional defini¬ 
tions, which are 


/(«) = 
/(*) = 


—2niux 


f(x)dx 


— OO 

coo 


e 2niux f(u)du . 


The reason we choose (B.l) in finance is because we work so much with probability theory, 
and as probabilists we like to have a Fourier transform of a density be equal to a char¬ 
acteristic function. However, nothing changes and we are able to carry out all the same 
calculations; the difference amounts merely to a change of variable inside the integrals. 


B.l Some Basic Properties 

• Linearity: if ijj(s) = af(s ) + bg(s), then 

${u) = af(u ) + bg(u) . 

• Translation/Time Shifting: if f(s) = ip(s — so), then 

f(u) = e iuso $(u) . 
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Convolution: for / * g(x) = f(y)g(x - y)dy , 


/ * 9(u) = 


poo roc 

Aux I 


f(y)g(x - y)dydx 

5 

/ oo 

e^-v) g {x - y)dxdy 

-OO 

= / e luy f{y)g(u)dy 
J —OO 

= f(u)g(u) . 


' — oo 
roc 


' —oo 
/*oo 


Reverse Convolution: 


/*sW = ^/_ 


oo /»oo 

—iux I 


f(v)g(u — v)dvdu 


hi 


— OO 
OO 


2vr 

= 


— OO 
OO 


7(u) / 

J— oc 
Vx f(v)dv 




g(w — v)dudv 


= 2tt g{x)f(x) . 


An important concept to realize about Fourier is that functions (e mi ) ug ]R can be thought 
of as orthonormal basis elements in a linear space. Similar to finite-dimensional vector 
spaces, we can write a function as a some inner products with the basis elements. Indeed, 
that is what we accomplish with the inverse Fourier transform; we can think of the integral 
as a sum, 

i /*°° i A. 

m = 5- / e-**f(u)du « ^ ;r , 

2w J - ~ 2 ' „i 

where u n are discrete points C (this is a similar idea to Fast Fourier Transforms (FFT)). 
The basis elements are orthogonal in the sense that 


1 

27T 


e lux e- iVX dx = 5 0 {u - v) 


Finally, it needs to be pointed out that the function Jo is something that defined under 
integrals, and is rather hard to formalize outside. For instance, in Parseval’s identity, we 
used do, but only inside the integral: 

Proposition B.1.1. For a real-valued function f{x) with \f(x)\ 2 dx < oo, 



2i L l/( “ )|2rf “ 
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Proof. 


1 

2tt 



1 

2n 


2tt 

1 

2n 


/ OO —- 

f(u)f(u)du 

-OO 

| /*oo /*oo POO 

/ / e iux f(x)e- iu yf(y)dxdydu 

J —OO «/— c 


r — oo «/ —oo «/ —OO 

/*oo /*oo 


—oo ./ —oo 
oo roo 


' —oo </ —oo 
/*oo 


f(x)fiy) 

f{x)f{y)5 0 (y - x)dxdy 
f(x)f(x)dx . 


e inx e -iny du dxdy 


□ 


B.2 Regularity Strips 


For some function / : M —> M we modify (B.l) and (B.2) to accommodate non-integrability 
and/or non-differentiability. We write 


/ OO 

e iux f(x)dx 

-OO 

i piz+oo 

fix) = tt- e~ iux J{u)du . 

^ J iz—oo 


where z = Q(u). The region of 2 values where the Fourier transform is defined is called the 
regularity strip. This comes in handy for functions like the European call and put payoffs. 

Example B.2.1 (Call Option). For a call option, f(x) = ( e s — K) + . This function has 
/logA' \f( s )\ 2 d s = 00 an d non-differentiability at s = log(A'). However, we can still write a 
Fourier transform: 


/ OO 

e ius (e s - K) + ds 

-OO 


f 


—OO 

oo 


s-\-ius 


ds-K 


log K 
s-\-ius oq 


Mog K 


1 + iu 


/ie 1 ' 


s=log K 


XU 


oo 

s=log K 


where the anti-derivative of e lus and e s+lus are the same as they are for real variables 
because the function is analytic in the complex plain. For z = Q(u) > 1, the anti-derivative 
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will be zero when evaluated at s = oo, will be infinite for z < 1, and undefined for z = 1. 
Taking z > 1, we have 


/(«) 


e (H-iu) log(E') j£ e iulog(K) 

1 + iu iu 

(' e iog(E')^( 1 +™) K ^ e \og(K)y u 

1 + iu iu 

j^l+iu 

~ o ? 

ZZZ — U z 


as shown in Table^T 7] o/ C7iapfer[?| 


Example B.2.2 (PutOption). For a put option, f{x) = (iP — e s ) + . This function has 
JiogK \f( s )\ 2 d s < °°j Z*A;e P u t option is non-differentiability at s = log(i^). Repeating 
the steps from the previous example, it follows that the regularity strip is z = 7s(u) < 0 (see 
Table 7.1 of Chapter^). 


B.3 The Wave Equation 


Consider the wave (transport) equation, 

!r(t,*) = cjw{i,x) 

V(0,x) = f(x) . 

The solution can be found with Fourier transforms: 



or simply 

Q ^ ^ 

—V(t,u) = -aiuV(t,u ) , 

V(0,u) = m . 

The solution is 

V(t,u) = e~ aiut f(u) , 
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which we invert to obtain the solution to the wave equation, 


V(t,x) 


g—iuXg—aiut 


f(u)du 


— OO 
’OO 


1 

2ITT 

— / e- iu(x+at) /(n)^ 

.7 — OO 

f(x + at) . 


The lines x = c — at are the characteristic lines so that for any c we have v(t, c — at ) = /(c); 
the initial information /(c) at point c is propagated along the characteristic line. 


B.4 The Heat Equation 

Consider the heat equation 


JV(f,x) = V(t,x) 

V(0,x) = f{x) , 

where and \f(x)\ 2 dx < oo. For scalar x we have A = and the inverse Fourier 
transform yields an ODE under the integral sign, 


d 1 d 2 \ i r°° / d 

m - SS?) v =sLs-==l 


1 

27T 

1 

“ 27T 
= 0, 


— OO 

/»oo 


1 (} 2 \ 

I —lUXl 

dx 2 y 

2 ' 


y + y ) V{t,u)du 


or simply, 


The solution to (B.3) is 


jV(t,u) = ~V(t,u) 
V(0,u ) = /(«) . 

V'(*,«) = /(«)e _I H . 


(B.3) 
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Applying the inverse Fourier transform yields the solution, 
V(t,x) = — f e~ lux V(t,u)du 

./-oo 

-I /•oo ^ 2 

= — / e~ iux f(u)e~ y ^ t du 
Z'Kj-oo 


r*oo / /*oo 

—iux I I 


e luy f{y)dy ) e “2 f du 


f{y) /°° dy 


y/tortJ-oo' K "' \\2 kJ-oo ) * 

= — ^ = J f(y) ^E , e tZ ( y ~ x ')/'^'j dy where Z ~ IV(0,1) 

= 7^tL f{y)e 2t dy - 

In fact, it can be seen as the expectation of a Brownian motion, 

V(t,x) =Ef(x + W t ) 

where Wj is a Brownian motion. If /(x) = 6o(x), then the we have the fundamental 
solution, 

1 _x 2 _ 

$(t,x) = - - e 2 1 

V2 jit 

from which solutions for different initial conditions are convolutions, 


/ OO 

/(y)<F(f,x-y) 

-OO 


B.5 The Black-Scholes Equation 


The Black-Scholes model is 


C -^~ = rdt + crdW® 


where W® is a risk-neutral Brownian motion. Letting Xf = log (St) the Black-Scholes 
equation is 


d a 2 d 2 ( a 2 \ d 


m + Yw + { r -Y)dx- r ) nt,*) = o 

V(T,x) = ^(x) , 


101 



where is the claim, e.g. a call option = ( e x — K) + . Using the same Fourier 

techniques as we did with the wave and heat equations, we have 


d_ 

dt 


a 2 u 2 


iu ^'r ——^ — r^j V (t,u) = 0 

V(T,u) = $(u) 


which has solution 

V(t,u) =^(u)e _(T ^ ) (^ +i “( r_ ^) +r ) . 

Applying the inverse Fourier transform, we arrive at the risk-neutral pricing formula (i.c 
Feynman-Kac), 

V(l.x) ‘ / e~ iux V(t,u)du 

2vr y„ 00 

p—r(T—t) poo 


2 ^" J —oo 

0 —r{T—t) poo 


2tt j —qq 
0 —r(T—t) poo 


f°° e~ iux ^(u)e (T 2 “ +iu ( r ^ ))du 

J — OO 

r e~ iux ( r e iuy 'tp(y)dy\ e _(T “ t) (^ +iu ( r “ V )) 

■J —oo \J —oo / 


du 


/°° r ^eHy-^-^-^e-^-^dudy . 

J —oo */ —OO 

Take change of variable y' = y — x — (T — t) (V — such that dy' = dy , then 


U(f,x) = 


a —r(T—t) poo 


2vr 

e -r(T-t) /-oo 


-V-^dudy' 


2vr 

= - 0 /I * (✓ + * + ( r - 1 ) (r - £) ) ( Ee 

e -r(T-t) r°° f / (j 2 \\ _I 

= ^w^ 7 ) J-Jv +x+iT - t) \:-v) e 2 

= e - rfT - , ' l E Q ip (x + (T - () fr - y 1 - tV, 0 )! 

= e" T ' (T " t) E Q U (A t ) 


IlI-j{ y,+x+iT - t) { r -^)) e,w 

j°° ip (y + x + (T - t) - y)) (y e iuy 'du'j dy 1 




dy' 


(y'f 

°H T -t) dy 


X t = x 
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